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HDAC9 POLYPEPTIDES AND POLYNUCLEOTIDES AND USES 



THEREOF 



RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application No 
5 60/298,173 filed on June 14, 2001, U.S. Provisional Application No. 60/311 686 
filed on August 10, 2001, and U.S. Provisional Application No. 60/316,995 'filed on 
September^ 2001. The entire teachings of the above applications are incorporated 
herein by reference. 

10 GOVERNMENT SUPPORT 

The invention was supported, in whole or in part, by grant CA-0974823 from 
the National Cancer Institute. The Government has certain rights in the invention. 

BACKGROUND OF THE INVENTION 
15 The N-terminal tails of core histones are covalently modified by post- 

translational modifications, including acetylation and phosphorylation. Evidence 
suggests that these covalent modifications play hnportant roles in several biological 
actmties involving chromatin, e.g., transcription and replication. Histone 
deacetylases (HDACs) catalyze the removal of the acetyl group from the lysine 
20 residues in the N-terminal tails of nucleosomal core histones resulting in a more 
compact chromatin structure, a configuration that is generally associated with 
repression of transcription. 

Five proteins and/or open reading frames in yeast (RPD3, HDA1 HOS 1 
HOS2 and HOS3) that share significant homology in the catalytic domain have been 

25 ^nnfiedasHDACsbasedupontheirsequencehomoIogytohumanHDACl To 
date, eight HDACs have been identified in mammalian cells, and classified into two 
classes based on their structure and similarity to yeast RPD3 or HDA1 proteins. 
Recently, Sir2 family proteins that are structurally unrelated to the five proteins 
aforementioned have been identified as NAD-dependent HDACs. Class I HDACs 

30 are the yeast RPD3 homologs HDAC1, 2, 3, and 8, and are composed primarily of a 
catalyticdomain. Class n HDACs are the yeast HDA1 homologs HDAC4, 5, 6; and 
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7. HDAC4, 5, and 7 contain a long non-catalytic N-terminal end and a C-terminal 
BDDAC catalytic domain while HDAC6 has two HDAC catalytic domains. 

It has also been determined that histone deacetylases can be sensitive to 
small molecules, including trichostatin A (TSA), trapoxin, and butyrate. For 
5 example, the yeast RPD3 and HDA1 and mammalian HDAC1, 2, 3, 4, 5, 6, 7 and 8 
are sensitive to inhibition by trichostatin A (TSA). The Sir2 family HDACs, yeast 
HOS3 and Drosophila melanogaster dHDAC6, however, appear to be relatively 
insensitive to TSA. A class of hybrid bipolar compounds, such as suberoylardlide 
hydroxamic acid (SAHA) have also been shown to inhibit histone deacetylases and 

1 0 induce terminal differentiation and/or apoptosis in various transformed cells. 

Examples of such compounds can be found in U.S. Patent Nos. 5,369,108, issued on 
November 29, 1994, 5,700,811, issued on December 23, 1997, and 5,773,474, issued 
on June 30, 1998 to Breslow et aL, as well as U.S. Patent Nos. 5,055,608, issued on 
October 8, 1991, and 5,175,191, issued on December 29, 1992 to Marks et al 9 the 

1 5 entire content of all of which are hereby incorporated by reference. 

The identification of the mechanisms by which histones are deacetylated, and 
the characterization of histone deacetylase function would be of great benefit in 
understanding how gene transcription is controlled, how the cell cycle is regulated, 
and how cells are signaled to undergo terminal differentiation and/or apoptosis. 

20 Elucidation of such mechanisms can lead to improved therapeutics for many 
diseases, in particular those characterized by cell proliferation or a lack of cell 
differentiation or apoptosis, for example, cancer. 

SUMMARY OF THE INVENTION 
25 The present invention relates to isolated or recombinant histone deacetylase 

polypeptides, and isolated histone deacetylase nucleic acid molecules encoding those 
polypeptides, as well as vectors and cells containing those isolated nucleic acid 
molecules. 

In one aspect of the invention, the isolated or recombinant histone 
30 deacetylase polypeptide is selected from a) an isolated or recombinant polypeptide 
comprising SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10; and b) a polypeptide having at least 60% sequence identity with any one 
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of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 
10. In one embodiment, the isolated or recombinant histone deacetylase polypeptide 
consists of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10. In another embodiment, the isolated orrecombinant histone deacetylase 
5 polypeptide is mammalian; preferably, the isolated or recombinant histone 
deacetylase polypeptide is human. 

hi another aspect, the invention features an isolated nucleic acid molecule 
selected from a) an isolated nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9; b) a complement of an isolated ' 
10 nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, or SEQ ID NO: 9; c) an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or 
SEQ ID NO: 10; d) a complement of an isolated nucleic acid encoding a histone 
deacetylase polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
15 NO: 8, or SEQ ID NO: 10; e) anucleic acid that is hybridizeable under high 

stringency conditions to a nucleic acid molecule that encodes any of SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or a complement thereof; or f) a ' 
nucleic acid molecule that is hybridizeable under high stringency conditions to a 
nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID 
20 NO: 7; and g) an isolated nucleic acid molecule that has at least 55% sequence 
identity with any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, or a complement thereof. In one embodiment, the isolated 
nucleic acid molecule consists of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, or SEQ ID NO: 9. In another embodiment, the isolated nucleic acid 
25 molecule is mammalian; preferably, the isolated nucleic acid molecule is human. 

In other aspects, the invention features a vector comprising the isolated 
histone deacetylase nucleic acid molecule described above, a cell comprising the 
vector, and a cell comprising the isolated histone deacetylase nucleic acid molecule 
described above. 

30 In another aspect, the invention features a purified antibody that selectively 

binds a histone deacetylase polypeptide described above. 
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In yet another aspect, the invention features a method of identifying a 
compound that modulates expression of a histone deacetylase nucleic acid molecule 
described above. The method comprises the steps of a) contacting the nucleic acid 
molecule with a candidate compound under conditions suitable for expression; and 
5 b) assessing the level of expression of the nucleic acid molecule. A candidate 
compound that increases or decreases expression of the nucleic acid molecule 
relative to a control is a compound that modulates expression of the nucleic acid 
molecule. In one embodiment, the method is carried out in a cell or animal. In 
another embodiment, the method is carried out in a cell free system. 

10 The invention also features a method of treating a cell proliferation disease, 

an apoptotic disease, or a cell differentiation disease, for example, cancers such as 
lymphoma, leukemia, melanoma, ovarian cancer, breast cancer, pancreatic cancer, 
prostate cancer, colon cancer, and lung cancer and myeloproliferative disorders, 
including polycythemia vera, essential thrombocythemia, agnogenic myeloid 

15 metaplasia, and chronic myelogenous leukemia in an individual, comprising 
administering a compound identified by the above method. 

In still another aspect, the invention features a method of identifying a 
compound that modulates the enzymatic activity of the histone deacetylase 
polypeptide described above. The method comprises the steps of a) contacting the 

20 polypeptide with a candidate compound under conditions suitable for enzymatic 
reaction; and b) assessing the activity level of the polypeptide. A candidate 
compound that increases or decreases the activity level of the polypeptide relative to 
a control is a compound that modulates the enzymatic activity of the polypeptide. In 
one embodiment, the method is carried out in a cell or animal. In another 

25 embodiment, the method is carried out in a cell free system. 

In yet another embodiment, the polypeptide is further contacted with a 
substrate for the polypeptide, wherein the substrate is selected from the group 
consisting of a cell proliferation disease binding agent, an apoptotic disease binding 
agent, and a cell differentiation disease binding agent. In one embodiment, the 

30 candidate compound is an inhibitor. In another embodiment, candidate compound is 
an activator. 
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In another aspect, the invention features a method of identifying a compound 
that modulates the transcriptional repression activity of the histone deacetylase 
polypeptide described above. The method comprises the steps of a) contacting the 
polypeptide with a candidate compound under conditions suitable for a 
5 transcriptional repression reaction; and b) assessing the transcriptional repression 
activity level of the polypeptide. A candidate compound that increases or decreases 
the transcriptional repression activity level of the polypeptide relative to a control is 
a compound that modulates the transcriptional repression activity of the polypeptide. 
In one embodiment, the method is canied out in a cell or animal. In another 
10 embodiment, the method is carried out in a cell fiee system. 

In yet another embodiment, the polypeptide is further contacted with a 
substrate for the polypeptide, wherein the substrate is selected from the group 
consisting of a cell proliferation disease binding agent, an apoptotic disease binding 
agent, and a cell differentiation disease binding agent. In one embodiment, the 
15 candidate compound is an inhibitor. In another embodiment, candidate compound is 
an activator. 

In another aspect, the invention features a method of identifying a compound 
that modulates expression of a histone deacetylase nucleic acid molecule described 
above. The method comprises the steps of a) providing a nucleic acid molecule 
20 comprising a promoter region of the histone deacetylase nucleic acid molecule 
described above, or part of such a promoter region, operably linked to a reporter 
gene; b) contacting the nucleic acid molecule or with a candidate compound; and c) 
assessing the level of the reporter gene. A candidate compound that increases or 
decreases expression of the reporter gene relative to a control is a compound that 
25 modulates expression of the histone deacetylase nucleic acid molecule described 
above. In one embodiment, the method is carried out in a cell. 

In still another aspect, the invention features a method of identifying a 
polypeptide that interacts with a histone deacetylase polypeptide described above in 
a yeast two-hybrid system. The method comprises the steps of a) providing a first 
30 nucleic acid vector comprising a nucleic acid molecule encoding a DNA binding 
domain and the histone deacetylase polypeptide described above; b) providing a 
second nucleic acid vector comprising a nucleic acid encoding a transcription 
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activation domain and a nucleic acid encoding a test polypeptide; c) contacting the 
first nucleic acid vector with the second nucleic acid vector in a yeast two-hybrid 
system; and d) assessing transcriptional activation in the yeast two-hybrid system. 
An increase in transcriptional activation relative to a control indicates that the test 
5 polypeptide is a polypeptide that interacts with the histone deacetylase polypeptide 
described above. 

The invention also features a pharmaceutical composition comprising a 
histone deacetylase polypeptide described above. 

In addition, the present invention features a method of diagnosing a cell 

10 proliferation disease, an apoptotic disease, or a cell differentiation disease in a 

subject. The method comprises the steps of a) obtaining a sample from the subject; 
and b) assessing the level of activity or expression of the histone deacetylase 
polypeptide described above or the level of the nucleic acid molecule described 
above in the sample. If the level is increased relative to a control, then the subject 

15 has an increased likelihood of having a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease, and if the level is decreased relative to a 
control, then the subject has a decreased likelihood of having a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease. In one embodiment, 
the polypeptide level is assayed using immunohistochemistry techniques. In another 

20 embodiment, the nucleic acid molecule level is assayed using in situ hybridization 
techniques. 

Compounds and/or polypeptides identified in the above-described screening 
methods are also part of the present invention. 

25 DESCRIPTION OF THE FIGURES 

FIG. 1 is a schematic representation of the order in which FIGS. 1 A-10 
should be viewed. 

FIGS. 1 A-1C show the cDNA sequence of HDAC9 (SEQ ID NO: 1). The 
arrows and numbers in the HDAC9 sequence indicate exons. The boxed portion of 
30 the sequence indicates the HDAC domain. 
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10 



15 



FIGS. 1D-1G show the cDNA sequence of HDAC9a (SEQ ID NO- 3) The 

^ws^dnumbe.inthe^CP.seqnenceindicateexon, Tie boxed portion of 
the sequence indicates the HDAC domain. 

FIGS. 1H-1I show the cDNA sequence of HDRP(ANLS) (SEQ ID NO:9) 
FIGS. 1 J-1L show the cDNA sequence of HDAC9(ANLSJ (SEQ ID NO-5) 
FIGS. 1M-10 show the cDNA sequence of HDAC9a(ANLS) (SEQ ID 
NO:7). v 

KG. 2 is a schematic representation of the order in which FIGS. 2A-2E 
should be viewed. 

FIG. 2A shows the amino acid sequence of HDAC9 (SEQ ID NO: 2). 
HG. 2B shows the amino acid sequence of HDAC9a (SEQ ID NO: 4). 
FIG. 2C shows the amino acid sequence of HDAC9(ANLS) (SEQ ID NO: 6) 
FIG. 2D shows the amino acid sequence of HDAC9a(ANLS) (SEQ ID NO: 

8). 

FIG. 2E shows the amino acid sequence of and HDRP(ANLS) (SEQ ID NO: 



FIG. 3 is a schematic representation of the order in which FIGS. 3A-3C 
should be viewed. 

FIGS. 3A-3C show an amino acid sequence alignment of HDRP (SEQ ID 
20 NO: n).HDAC9(SEQIDNO:2),HDAC9 a (SE Q lDNO:4),audHDAC4(SEQ 
ID NO: 12) polypeptides. Amino acid sequences of HDAC9 (GenBarik Accession- 

AY032737;SEQIDNO: 2 )andin ) AC9a(( fe nBankAccession:AY032738-SEQ' 
ID NO: 4) are aligned with HDRP (GenBank Accession: BAA34464; SEQ D NO- 

")andHDAC4(GenBan k Accession:NP_006028;SEQIDNO:12). Theidentical 
res.duesinallproteinsareboxedwithsolidlines. The similar residues are boxed 
with dotted lines. 

FIG. 4 shows a schematic representation of the human HDAC9 gene 
structure. The striped boxes represent exons present in isoforms HDRP HDAC9a 
and HDAC9. The lines represent introns. Broken lines are used for larger introns ' 
(wathsizeinbasepairontop). The 5' untranslated region cDNA and coding region 
cDNA are represented here. Exons 1-12 encode a non-catalytic domain of the 
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polypeptides, and exons 14-21 encode the histone deacetylase catalytic domain of 
the polypeptides, which provide the polypeptides with deacetylase activity. 

FIG. 5 is a schematic representation of the order in which FIGS. 5A-5D 
should be viewed. 

5 FIGS. 5A-5D show the nucleic acid sequence of HDAC9, containing all 

exons expressed in the various isoforms of HDAC9, HDAC9a 9 HDAC9(ANLS) 9 
HDAC9a(ANLS), and HDRP(ANLS) of the present invention (SEQ ID NO:13). 

FIG. 6A is a scanned imaged of a multiple human tissue Northern blot that 
was probed to determine mRNA expression of HDAC9 using a cDNA probe that 
10 recognizes both HDAC9 and HDAC9a. The tissues examined are lane 1, heart; lane 
2, brain; lane 3, placenta; lane 4, lung; lane 5, liver; lane 6, skeletal muscle; lane 7, 
kidney; and lane 8, pancreas. Positions of the RNA size marker in kilobases (kb) are 
indicated to the left of the blot. 

FIG. 6B is a scanned image of an electrophoretic gel showing the results of 
15 RT-PCR analyses of mRNA from the same tissues as examined in the Northern blot 
of FIG. 6A to determine the distribution of HDAC9 and HDAC9a mRNA among • 
these tissues. PCR products were resolved by agarose gel electrophoresis and 
visualized by ethidium bromide under UV light. A 1-kb DNA ladder was run on 
both sides of the gel with the size (in kb) indicated on the left. On the right side, the 
20 expected products for HDAC9 and HDAC9a are indicated as 9 and 9a, respectively. 
FIG. 7 is a graph of HDAC enzymatic activity of HDAC anti-FLAG- 
immunoprecipitated proteins isolated from vector control, HDAC9-FLAG, and 
HDAC9a-FLAG transfected 293T cells, as measured in fluorescence units using 
FLUOR DE LYS™ as a substrate in the presence or absence of 1 \iM TSA. Results 
25 are shown as the mean of three independent assays. The inset is a scanned image of 
an anti-FLAG Western blot showing the amount of proteins used in the assay. V, 
Vector control; 9, HDAC9-FLAG; and 9a, HDAC9a-FLAG. 

FIG. 8 is a graph of HDAC enzymatic activity of HDAC anti-FLAG- 
immunoprecipitated proteins isolated from vector control, and HDAC9a-FLAG 
30 (treated with 2 fxM SAHA or left untreated) transfected 293T cells, as measured by 
3 H-acetic acid released from 3 H-histones in the presence or absence of 2 |xM SAHA. 
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Vector control; HDAC9a, HDAC9a-FLAG; and HDAC9a + , HDAC9a-FLAG + 



SAHA. 



HG. 9A shows a scanned image of a Western blot of 293T whole cell lysate 
and anti-FLAG immunoprecipitates from 293T cells transfected with vector 
5 HDAC9.FLAGorHDAC9a^ 

panel, anti-MEF2 Western; bottom panel, anti-FLAG Western. L, 293T whole cell 
lysate; V, vector control IP; 9, HDAC9-FLAG IP; 9a, HDAC9a-FLAG IP. 

FIG. 9B is a graph showing the transcription level of p3XMEF2-Z«c in the 
presence or absence of pcDNA3 empty vetfor (-), pCMV-MEF2Q and/or a vector 
10 encoding pFLAG-HDAC9 orpFLAG-HDAC9a. p3XMEF2-Z W c(100ng)andpRL- 
TK (5 ng) were transfected into 293T cells with P cDNA3 empty vector (-) or with 
PCMV-MEF2C (100 ng) ( + ) along with the indicated amount of P FLAG-HDAC9 or 
PFLAG-HDAC9, pFLAG empty vector was used to adjust the DNA to an equal 
amount in each transection. The ffrefly luciferase activity was first normalized to 
15 the co-transfected Renilla luciferase activity and the value for MEF2C alone was 
then set as 1. Results are shown as the mean of three independent transfections + /- 
standard deviation. 

FIG. 10 shows a schematic representation of the HDAC domains of human 
non-Sir2 family HDACs and HDRP. The boxes represent histone deacetylase 
20 (HDAC) domains. 

FIG. 1 1 is a schematic representation of the order in which FIGS. 1 1A-1 IF 
should be viewed. 

FIGS. 1 1 A- 1 IF show the nucleotide sequence of me vector pFLAG-CMV- 
5b-HDAC9 (VR1) (SEQ ID NO: 14). Lowercase letters are vector backbone 
25 uppercase letters are HDAC9 sequence. "Ace" was added at the beginning of the 
HDAC9 sequence for translation initiation. 

FIG. 12 is a schematic representation of the order in which FIGS. 12-1 
through 12-66 should be viewed. 

FIGS. 12-1 through 12-66 show the nucleotide sequence of the vector 

30 pFLAG-CMV-5b-HDAC9a (VR2), with restriction enzyme sites indicated (SEQ ID 
NO: 14). 
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FIG. 13 is a schematic representation of the order in which FIGS. 13A-13E 
should be viewed. 

FIGS. 13A-13E show the nucleotide sequence of the vector pFLAG-CMV- 
5b-HDAC9a (VR2) (SEQ ID NO: 15). Lowercase letters are vector backbone, 
5 uppercase letters are HDAC9a sequence. "Acc" was added at the beginning of the 
HDAC9a sequence for translation initiation. 

FIG. 14 is a schematic representation of the order in which FIGS. 14-1 
through 14-61 should be viewed. 

FIGS. 14-1 through 14-61 show the nucleotide sequence of the vector 
10 pFLAG-CMV-5b-HDAC9a (VR2), with restriction enzyme sites indicated (SEQ ID 
NO: 15). 

DETAILED DESCRIPTION OF THE INVENTION 

A protein designated HDRP (See Zhou et aL, Proc. Natl. Acad. Sci. USA, 

15 97:1056-1061 (2000)) (also called MITR (See Sparrow et aL, EMBO J. 18:5085- 
5098(1999); Zhang et aL, J. Biol. Chem., 276:35-39 (2001); and Zhang et aL, Proc. 
Natl. Acad. Sci. USA, 98:7354-7359 (2001)) that is 50% identical to the N-terminal 
domains of histone deacetylase 4 (HDAC4) and histone deacetylase 5 (HDAC5) was 
recently identified. The cloning and characterization of a novel histone deacetylase, 

20 HDAC9, of which HDRP is an alternatively spliced isoform is described herein. The 
cDNA sequence of HDAC9 is shown in FIGS. 1A-1C (SEQ ID NO: 1), and the 
HDAC9 amino acid sequence is shown in FIG. 2A (SEQ ID NO: 2). In addition to 
cloning HDAC9, other alternatively spliced isoforms of HDAC9, designated as 
HDAC9a (a polypeptide that is 132 amino acids shorter at the C-terminal end than 

25 HDAC9), and isoforms of HDAC9, HDAC9a, and HDRP polypeptides that lack the 
nuclear localization signal (NLS) in the N-terminal non-catalytic end of HDAC9, 
termed HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS), respectively were 
also identified. The cDNA sequence of HDAC9a is shown in FIGS. 1D-1G (SEQ 
ID NO: 3), and the HDAC9a amino acid sequence is shown in FIG. 2B (SEQ ID 

30 NO: 4). The cDNA sequence of HDAC9 lacking amino acids encoding an NLS 

(HDA C9(ANLS)) is shown in FIGS. 1 J-1L (SEQ ID NO: 5), and the HDAC9 lacking 
an NLS amino acid sequence is shown in FIG. 2C (SEQ ID NO: 6). The cDNA 
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sequence of HDAC9a encoding a polypeptide lacking an NLS (HDAC9a(ANLS)) is 
shown in FIGS. 1M-10 (SEQ ID NO: 7), and the HDAC9a lacking an NLS amino 
acid sequence is shown in FIG. 2D (SEQ ID NO: 8). The cDNA sequence of HDRP 
encoding a polypeptide lacking an NLS (HDRP(ANLS)) is shown in FIGS. 1H-1I 
5 (SEQ ID NO: 9), and the HDRP lacking an NLS amino acid sequence is shown in 
FIG. 2E (SEQ ID NO: 10). 

POLYPEPTIDES OF THE INVENTION 

The present invention features isolated or recombinant HDAC9 polypeptides, 
10 HDAC9a polypeptides, HDAC9(ANLS) polypeptides, HDAC9a(ANLS) 

polypeptides, and HDRP(ANLS) polypeptides, and fragments, derivatives, and 
variants thereof, as well as polypeptides encoded by nucleotide sequences described 
herein (e.g., other variants). As used herein, the term "polypeptide" refers to a 
polymer of amino acids, and not to a specific length; thus, peptides, oligopeptides, 
1 5 and proteins are included within the definition of a polypeptide. 

As used herein, a polypeptide is said to be "isolated," "substantially pure," or 
"substantially pure and isolated" when it is substantially free of cellular material, 
when it is isolated from recombinant or non-recombinant cells, or free of chemical 
precursors or other chemicals when it is chemically synthesized. Typically, the 
20 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide is isolated, substantially pure, or substantially pure and isolated when it 
has a relative increased concentration or activity of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), in comparison to total HDAC 
concentration or activity. Preferably the increased activity or concentration of the 
25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is at least 
2-fold, more preferably, at least 5-fold, and most preferably, at least 10 fold, in 
comparison to total HDAC concentration or activity. In addition, a polypeptide can 
be joined to another polypeptide with which it is not normally associated in a cell 
(e.g., in a "fusion protein") and still be "isolated," "substantially pure," or 
30 "substantially pure and isolated." An isolated, substantially pure, or substantially 
pure and isolated polypeptide may be obtained, for example, using affinity 
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purification techniques described herein, as well as other techniques described herein 
and known to those skilled in the art. 

By a "histone deacetylase polypeptide" is meant a polypeptide having histone 
deacetylase activity, transcription repression activity, and/or the ability to deacetylate 
5 other substrates, for example, transcription factors, including p53, CoRest, E2F, 
GATA-1, TFHe, and TFHF that normally have a nuclear or cytoplasmic location in a 
cell. A histone deacetylase polypeptide is also a polypeptide whose activity can be 
inhibited by molecules having HDAC inhibitory activity. These molecules fall into 
four general classes: 1) short-chain fatty acids (e.g., 4-phenylbutyrate and valproic 

10 acid); 2) hydroxamic acids(e.g. SAHA, Pyroxamide, trichostatin A (TSA), 

oxamflatin and CHAPs, such as, CHAP1 and CHAP 31); 3) cyclic tetrapeptides 
(Trapoxin A, Apicidin and Depsipeptide (FK-228, also known as FR901 1228); 4) 
benzamides (e.g., MS-275); and other compounds such as Scriptaid. Examples of 
such compounds can be found in U.S. Patent Nos. 5,369,108, issued on November 

15 29, 1994, 5,700,81 1, issued on December 23, 1997, and 5,773,474, issued on June 
30, 1998 to Breslow et al. 9 U.S. Patent Nos. 5,055,608, issued on October 8, 1991, 
and 5,175,191, issued on December 29, 1992 to Marks et aL 9 as well as, Yoshida et 
aL 9 Bioessays 17, 423-430 (1995), Saito et al y PNAS USA 96, 4592-4597, (1999), 
Furamai et aL 9 PNAS USA 98 (1), 87-92 (2001), Komatsu et aL, Cancer Res. 

20 61(1 1), 4459-4466 (2001), Su et aL 9 Cancer Res. 60, 3137-3 142 (2000), Lee et aL, 
Cancer Res. 61(3), 931-934 and Suzuki et aL J. Med. Chem. 42(15), 3001-3003 
(1999) the entire content of all of which are hereby incorporated by reference. 
Examples of such histone deacetylase polypeptides include HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), HDRP(ANLS); a substantially pure polypeptide 

25 comprising SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10; and a polypeptide having preferably at least 60%, more preferably, 70%, 
75%, 80%, 85%o, or 90%, and most preferably, 95% sequence identity to any one of 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, 
as determined using the BLAST program and parameters described herein. 

30 In one embodiment, the histone deacetylase polypeptide has histone 

deacetylase activity, transcription repression activity, the ability to deacetylate 
substrates, or is inhibited by trichostatin A or a hybrid polar compound such as 
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SAHA. In another embodiment, the HDAC9(ANLS) polypeptide has any two of the 
above biological activities. In still another embodiment, the HDAC9(ANLS) 
polypeptide has any three of the above biological activities. In yet another 
embodiment, the HDAC9(ANLS) polypeptide has all of the above biological . 
5 activities. 

An HDAC9 polypeptide is a histone deacetylase polypeptide as described 
above. An HDAC9 polypeptide preferably has at least 60%, more preferably, 70%, 
75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ ID 
NO: 2, as determined using the BLAST program and parameters described herein. 

10 An HDAC9 polypeptide is also a polypeptide that comprises the amino acids 

encoded by exons 23, 24, 25 and/or 26, and that does not comprise the amino acids 
encoded by exon 1 3 of the HDAC9 nucleic acid sequence, as shown in FIGS. 1 A- 
1C, FIG. 4, and FIGS. 5A-5D. Preferably, an HDAC9 polypeptide comprises the 
sequence of SEQ ID NO: 2. More preferably, an HDAC9 polypeptide consists of 

15 the sequence of SEQ ID NO: 2. An HDAC polypeptide is also a polypeptide 

comprising the amino acid sequence of the polypeptide encoded by the nucleic acid 
sequence of SEQ ID NO: 1. 

An HDAC9a polypeptide is a histone deacetylase polypeptide as described 
above. An HDAC9a polypeptide preferably has at least 60%, more preferably, 70%, 

20 75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ ED 
NO: 4, as determined using the BLAST program and parameters described herein. 
An HDAC9a polypeptide is also a polypeptide that comprises the amino acids 
encoded by exon 22, and that does not comprise the amino acids encoded by exons 
13, 23, 24, 25, or 26 of the HDAC9 nucleic acid sequence, as shown in FIGS. 1D- 

25 1G, FIG. 4, and FIGS. 5A-5D. Preferably, an HDAC9a polypeptide comprises the 
sequence of SEQ ID NO: 4. More preferably, an HDAC9a polypeptide consists of 
the sequence of SEQ ID NO: 4. An HDAC9a polypeptide is also a polypeptide 
comprising the amino acid sequence of the polypeptide encoded by the nucleic acid 
sequence of SEQ ID NO: 3. 
30 An HDAC9(ANLS) is a histone deacetylase polypeptide as described above. 

An HDAC9(ANLS) polypeptide does not comprise a nuclear localization signal 
(NLS). An HDAC9(ANLS) polypeptide preferably has at least 60%, more 
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preferably, 70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence 
identity to SEQ ID NO: 6, as determined using the BLAST program and parameters 
described herein. An HDAC9(ANLS) polypeptide is also a polypeptide that 
comprises the amino acids encoded by exons 23, 24, 25, and/or 26, and that does not 
5 comprise the amino acids encoded by exons 7 or 13 of the HDAC9 nucleic acid 
sequence, as shown in FIGS. 1J-1L, and FIGS. 5A-5D. Preferably, an 
HDAC9(ANLS) polypeptide comprises the sequence of SEQ ID NO: 6. More 
preferably, an HDAC9(ANLS) polypeptide consists of the sequence of SEQ ID NO: 
6. An HDAC9(ANLS) polypeptide is also a polypeptide comprising the amino acid 

1 0 sequence of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 5 . 
An HDAC9a(ANLS) polypeptide is a histone deacetylase polypeptide as 
described above. An HDAC9a(ANLS) does not comprise a nuclear localization 
signal (NLS). An HDAC9a(ANLS) polypeptide preferably has at least 60%, more 
preferably, 70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence 

15 identity to SEQ ID NO: 8, as determined using the BLAST program and parameters 
described herein. An HDAC9a(ANLS) polypeptide is also a polypeptide that 
comprises the amino acids encoded by exon 22, and that does not comprise the 
amino acids encoded by exons 7, 13, 23, 24, 25, or 26 of the HDAC9 nucleic acid 
sequence, as shown in FIGS. 1M-10, and FIGS. 5A-5D. Preferably, an 

20 HDAC9a(ANLS) polypeptide comprises the sequence of SEQ ID NO: 8. More 

preferably, an HDAC9a(ANLS) polypeptide consists of the sequence of SEQ ID NO: 
8. An HDAC9a(ANLS) polypeptide is also a polypeptide comprising the amino acid 
sequence of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 7. 
An HDRP(ANLS) polypeptide is a histone deacetylase polypeptide as 

25 described above. An HDRP(ANLS) does not comprise a nuclear localization signal 
(NLS). An HDRP(ANLS) polypeptide preferably has at least 60%, more preferably, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% sequence identity to SEQ 
ID NO: 10, as determined using the BLAST program and parameters described 
herein. An HDRP(ANLS) polypeptide is also a polypeptide that does not comprise 

30 the amino acids encoded by exons 7 or 13-26 of the HDAC9 nucleic acid sequence, 
as shown in FIGS. 1H-1I and FIGS. 5A-5D. Preferably, an HDRP(ANLS) 
polypeptide comprises the sequence of SEQ ID NO: 10. More preferably, an 
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HDRP(ANLS) polypeptide consists of the sequence ofSEQ ID NO: 10. An 
HDRP(ANLS) polypeptide is also a polypeptide comprising the amino acid sequence 
of the polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 9. 

The polypeptides of the invention can be purified to homogeneity. It is 
5 understood, however, that preparations in which the polypeptide is not purified to 
homogeneity are useful. The critical feature is that the preparation allows for the 
desired function of the polypeptide, even in the presence of considerable amounts of 
other components. Thus, the invention encompasses various degrees of purity. In 
one embodiment, the language "substantially free of cellular material" includes 
10 preparations of the polypeptide having less than about 30% (by dry weight) other 
proteins (z.e, contaminating protein), less than about 20% other proteins, less than 
about 10% other proteins, or less than about 5% other proteins. 

When a polypeptide is recombinantly produced, it can also be substantially 
free of culture medium, i.e., culture medium represents less than about 20%, less 
15 than about 10%, or less than about 5% of the volume of the polypeptide preparation. 
The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the polypeptide in which it is separated from chemical precursors or 
other chemicals that are involved in its synthesis. In one embodiment, the language 
"substantially free of chemical precursors or other chemicals" includes preparations 
20 of the polypeptide having less than about 30% (by dry weight) chemical precursors 
or other chemicals, less than about 20% chemical precursors or other chemicals, less 
than about 10% chemical precursors or other chemicals, or less than about 5% 
chemical precursors or other chemicals. 

In one embodiment, a polypeptide of the invention comprises an amino acid 
25 sequence encoded by a nucleic acid molecule comprising a nucleotide sequence 

selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 9, and complements and portions thereof, (e.g., a 
complement of any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9 or a portion of any one of SEQ ID NO: 1 or SEQ ID NO: 3, 
30 SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9). 

The polypeptides of the invention also encompass fragments and sequence 
variants. Variants include a substantially homologous polypeptide encoded by the 
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same genetic locus in an organism, i.e. 9 an allelic variant, as well as other variants. 
Variants also encompass polypeptides derived from other genetic loci in an 
organism, but having substantial homology to a polypeptide encoded by a nucleic 
acid molecule comprising a nucleotide sequence selected from the group consisting 
5 of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, 
and complements and portions thereof, or having substantial homology to a 
polypeptide encoded by a nucleic acid molecule comprising a nucleotide sequence 
selected from the group consisting of nucleotide sequences encoding any one of SEQ 
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. 

10 Variants also include polypeptides substantially homologous or identical to these 
polypeptides but derived from another organism, le., an ortholog. Variants also 
include polypeptides that are substantially homologous or identical to these 
polypeptides that are produced by chemical synthesis. Variants also include 
polypeptides that are substantially homologous or identical to these polypeptides that 

15 are produced by recombinant methods. 

As used herein, two polypeptides (or a region of the polypeptides) are 
substantially homologous or identical when the amino acid sequences are at least 
about 60-65%, typically at least about 70-75%, more typically at least about 80-85%, 
and most typically greater than about 90-95% or more homologous or identical. A 

20 substantially identical or homologous amino acid sequence, according to the present 
invention, will be encoded by a nucleic acid molecule hybridizing to SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, or a portion thereof, 
under stringent conditions as more particularly described herein, or will be encoded 
by a nucleic acid molecule hybridizing to a nucleic acid sequence encoding SEQ ID 

25 NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, or portion 
thereof, under stringent conditions as more particularly described herein. 

The percent identity of two nucleotide or amino acid sequences can be 
determined by aligning the sequences for optimal comparison purposes (e.g., gaps 
can be introduced in the sequence of a first sequence). The nucleotides or amino 

30 acids at corresponding positions are then compared, and the percent identity between 
the two sequences is a function of the number of identical positions shared by the 
sequences (i.e., % identity = # of identical positions/total # of positions x 100). In 
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certain embodiments, the length of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) amino acid or nucleotide sequence aligned for 
comparison purposes is at least 30%, preferably, at least 40%, more preferably, at 
least 60%, and even more preferably, at least 70%, 80%, 90%, or 100% of the length 
5 of the reference sequence, for example, those sequences provided in FIGS. 1A-10 
and 2A-2E. The actual comparison of the two sequences can be accomplished by 
well-known methods, for example, using a mathematical algorithm. A preferred, 
non-limiting example of such a mathematical algorithm is described in Karlin et al, 
Proc. Natl. Acad. Sci. USA, 90:5873-5877 (1993). Such an algorithm is 
10 incorporated into the BLASTN and BLASTX programs (version 2.2) as described in 
Schaffer et ah, Nucleic Acids Res., 29:2994-3005 (2001). When utilizing BLAST 
and Gapped BLAST programs, the default parameters of the respective programs 
(e.g., BLASTN) can be used. See http://www.ncbi.nlm.nih.gov, as available on 
August 10, 2001. In one embodiment, the database searched is a non-redundant 
15 (NR.) database, and parameters for sequence comparison can be set at: no filters; 
Expect value of 10; Word Size of 3; the Matrix is BLOSUM62; and Gap Costs have 
an Existence of 1 1 and an Extension of 1 . 

Another preferred, non-limiting example of a mathematical algorithm 
utilized for the comparison of sequences is the algorithm of Myers and Miller, 
20 CABIOS (1989). Such an algorithm is incorporated into the ALIGN program 
(version 2.0), which is part of the GCG (Accelrys) sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, 
a PAM120 weight residue table, a gap length penalty of 12 , and a gap penalty of 4 
can be used. Additional algorithms for sequence analysis are known in the art and 
25 include ADVANCE and ADAM as described in Torellis and Robotti, Comput. 
Appl. Biosci., 10: 3-5 (1994); and FASTA described in Pearson and Lipman, Proc. 
Natl. Acad. Sci USA 85: 2444-8 (1988). 

In another embodiment, the percent identity between two amino acid 
sequences can be accomplished using the GAP program in the GCG software 
30 package (available at http://www.accelrys.com, as available on August 31, 2001) 
using either a Blossom 63 matrix or a PAM250 matrix, and a gap weight of 12, 1 0, 
8, 6, or 4 and a length weight of 2, 3, or 4. In yet another embodiment, the percent 
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identity between two nucleic acid sequences can be accomplished using the GAP 
program in the GCG software package (available at http://www.cgc.com), using a 
gap weight of 50 and a length weight of 3. 

The invention also encompasses HDAC9, HDAC9a, HDAC9(ANLS), 
■ 5 HDAC9aANLS, and HDRP(ANLS) polypeptides having a lower degree of identity 
but having sufficient similarity so as to perform one or more of the same functions 
performed by an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, or 
HDRP(ANLS) polypeptide encoded by a nucleic acid molecule of the invention. 
Similarity is determined by conserved amino acid substitution. Such substitutions 

10 are those that substitute a given amino acid in a polypeptide by another amino acid 
of like characteristics. Conservative substitutions are likely to be phenotypically 
silent. Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the 
hydroxyl residues Ser and Thr; exchange of the acidic residues Asp and Glu; 

1 5 substitution between the amide residues Asn and Gin; exchange of the basic residues 
Lys and Arg; and replacements among the aromatic residues Phe and Tyr; Guidance 
concerning which amino acid changes are likely to be phenotypically silent are found 
inBowie etal, Science 247: 1306-1310 (1990). 

A variant polypeptide can differ in amino acid sequence by one or more 

20 substitutions, deletions, insertions, inversions, fusions, and truncations or a 

combination of any of these. Further, variant polypeptides can be fully functional or 
can lack function in one or more activities, for example, in histone deacetylase 
activity or transcription repression activity. Fully functional variants typically 
contain only conservative variation or variation in non-critical residues or in 

25 non-critical regions. Functional variants can also contain substitution of similar 
amino acids that result in no change or an insignificant change in function. 
Alternatively, such substitutions may positively or negatively affect function to some 
degree. Non-functional variants typically contain one or more non-conservative 
amino acid substitutions, deletions, insertions, inversions, or truncations or a 

30 substitution, insertion, inversion, or deletion in a critical residue or critical region, 
such critical regions include the HDAC domains, which provide the polypeptide 
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with deacetylase activity, as shown in the nucleic acid sequences of FIGS. 1 A-1G, as 
well as in the schematic of FIG. 4. 

Amino acids that are essential for function can be identified by methods 
known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis 
5 (Cunningham et al y Science, 244: 1081-1085 (1989)). The latter procedure 
introduces a single alanine mutation at each of the residues in the molecule (one 
mutation per molecule). The resulting mutant molecules are then tested for 
biological activity in vitro. Sites that are critical for polypeptide activity can also be 
determined by structural analysis, such as crystallization, nuclear magnetic 

10 resonance, or photoaffinity labeling (See Smith et aL, J. Mol. BioL, 224: 899-904 
(1992); and de Vos et al Science, 255: 306-312 (1992)). 

The invention also includes HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) polypeptide fragments of the polypeptides of 
the invention. Fragments can be derived from a polypeptide comprising SEQ ID 

15 NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 1 0, or from 
a polypeptide encoded by a nucleic acid molecule comprising SEQ ID NO: 1, SEQ 
ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9 or a portion thereof and 
the complements thereof or other variants. The present invention also encompasses 
fragments of the variants of the polypeptides described herein. Useful fragments 

20 include those that retain one or more of the biological activities of the polypeptide as 
well as fragments that can be used as an immunogen to generate polypeptide-specific 
antibodies. 

Biologically active fragments (peptides that are, for example, 6, 9, 12, 15, 16, 
20, 30, 35, 36, 37, 38, 39, 40, 50, 100, or more amino acids in length) can comprise 

25 a domain, segment, or motif, for example, an HDAC domain, that has been 

identified by analysis of the polypeptide sequence using well-known methods, e.g., 
signal peptides, extracellular domains, one or more transmembrane segments or 
loops, ligand binding regions, zinc finger domains, DNA binding domains, acylation 
sites, glycosylation sites, or phosphorylation sites. 

30 Fragments can be discrete (not fused to other amino acids or polypeptides) or 

can be within a larger polypeptide. Further, several fragments can be comprised 
within a single larger polypeptide. In one embodiment a fragment designed for 
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expression in a host can have heterologous pre- and pro-polypeptide regions fused to 
the amino terminus of the polypeptide fragment and an additional region fused to the 
carboxyl terminus of the fragment. 

The invention thus provides chimeric or fusion polypeptides. These 
5 comprise an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, or HDRP(ANLS) 
polypeptide of the invention operatively linked to a heterologous protein or 
polypeptide having an amino acid sequence not substantially homologous to the 
polypeptide. "Operatively linked" indicates that the polypeptide and the 
heterologous protein are fused in-frame. The heterologous protein can be fused to 

10 the N-terminus or C-terminus of the polypeptide. In one embodiment, the fusion 
polypeptide does not affect the function of the polypeptide per se. For example, the 
fusion polypeptide can be a GST-fusion polypeptide in which the polypeptide 
sequences are fused to the C-terminus of the GST sequences. Other types of fusion 
polypeptides include, but are not limited to, enzymatic fusion polypeptides, for 

15 example, P-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions, 
and Ig fusions. Such fusion polypeptides, particularly poly-His frasions, can 
facilitate the purification of recombinant polypeptide. In certain host cells {e.g., 
mammalian host cells), expression and/or secretion of a polypeptide can be 
increased by using a heterologous signal sequence. Therefore, in another 

20 embodiment, the fusion polypeptide contains a heterologous signal sequence at its 
N-terminus. 

EP-A 0464 533 discloses fusion proteins comprising various portions of 
immunoglobulin constant regions. The Fc is useful in therapy and diagnosis and 
thus results, for example, in improved pharmacokinetic properties (EP-A 0232 262). 

25 In drug discovery, for example, human proteins have been fused with Fc portions for 
the purpose of high-throughput screening assays to identify antagonists. (See 
Bennett et al 9 Journal of Molecular Recognition, 8: 52-58 (1995) and Johanson et 
al 7 The Journal of Biological Chemistry, 270,16: 9459-9471 (1995)). Thus, this 
invention also encompasses soluble fusion polypeptides containing a polypeptide of 

30 the invention and various portions of the constant regions of heavy or light chains of 
immunoglobulins of various subclass (IgG, IgM, IgA, IgE). 
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A chimeric or fusion polypeptide can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide 
sequences are ligated together in-frame in accordance with conventional techniques. 
In another embodiment, the fusion gene can be synthesized by conventional 
5 techniques including automated DNA synthesizers. Alternatively, PCR 

amplification of nucleic acid fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive nucleic acid 
fragments that can subsequently be annealed and re-amplified to generate a chimeric 
nucleic acid sequence (see Ausubel et aL, "Current Protocols in Molecular Biology," 

1 0 John Wiley & Sons, (1 998), the entire teachings of which are incorporated by 
reference herein). Moreover, many expression vectors are commercially available 
that already encode a fusion moiety (e.g., a GST protein). A nucleic acid molecule 
encoding a polypeptide of the invention can be cloned into such an expression vector 
such that the fusion moiety is linked in-frame to the polypeptide. 

1 5 The substantially pure, isolated, or substantially pure and isolated HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9aANLS, or HDRP(ANLS) polypeptide can be 
purified from cells that naturally express it, purified from cells that have been altered 
to express it (recombinant), or synthesized using known protein synthesis methods. 
In one embodiment, the polypeptide is produced by recombinant DNA techniques. 

20 For example, a nucleic acid molecule encoding the polypeptide is cloned into an 
expression vector, the expression vector introduced into a host cell, and the 
polypeptide expressed in the host cell. The polypeptide can then be isolated from 
the cells by an appropriate purification scheme using standard protein purification 
techniques. 

25 In general, HDAC9, HDAC9a, HDAC9(ANLS), HDAC9aANLS, and 

HDRP(ANLS) polypeptides of the present invention can be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
using art-recognized methods. The polypeptides of the present invention can be 
used to raise antibodies or to elicit an immune response. The polypeptides can also 

30 be used as a reagent, e.g., a labeled reagent, in assays to quantitatively determine 
levels of the polypeptide or a molecule to which it binds (e.g., a receptor or a ligand) 
in biological fluids. The polypeptides can also be used as markers for cells or tissues 
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in which the corresponding polypeptide is preferentially expressed, either 
constitutively, during tissue differentiation, or in a diseased state. The polypeptides 
can be used to isolate a corresponding binding agent, and to screen for peptide or 
small molecule antagonists or agonists of the binding interaction. The polypeptides 
5 of the present invention can also be used as therapeutic agents. 

NUCLEIC ACID MOLECULES OF THE INVENTION 

The present invention also features isolated HDAC9, HDAC9a 9 
HDAC9(ANLS), HDAC9a(MLS) y and HDRP(ANLS) nucleic acid molecules. 

10 By a "histone deacetylase nucleic acid molecule" is meant a nucleic acid 

molecule that encodes a histone deacetylase polypeptide. Such histone nucleic acids 
include, for example, the HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) nucleic acid molecule described in detail herein; an isolated nucleic 
acid comprising SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or 

15 SEQ ID NO: 9; a complement of an isolated nucleic acid comprising SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9; an isolated 
nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; a complement of an 
isolated nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 2, 

20 SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; a nucleic acid 
that is hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes any of SEQ ID NO: 2, SEQ ED NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or 
a complement thereof; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, 

25 SEQ ID NO: 5, or SEQ ID NO: 7; and an isolated nucleic acid molecule that has at 
least 55%, more preferably, 60%, 65%, 70%, 75%, 80%, 85%, or 90%, and most 
preferably, 95% or 99% sequence identity with any one of SEQ ID NO: 1, SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, or a complement thereof. 

An HDAC9 nucleic acid molecule is a nucleic acid molecule that encodes an 

30 HDAC9 polypeptide. In one embodiment, the HDAC9 nucleic acid molecule is 
selected from: a nucleic acid molecule that comprises the nucleic acid sequence of 
SEQ ID NO: 1; a complement of an isolated nucleic acid comprising SEQ ID NO: 1; 
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an isolated nucleic acid encoding a histone deacetylase polypeptide of SEQ ID NO: 
2; a complement of an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 2; a nucleic acid that is hybridizeable under high 
stringency conditions to a nucleic acid molecule that encodes SEQ ID NO: 2; a 
5 nucleic acid molecule that is hybridizeable under high stringency conditions to a 
nucleic acid comprising SEQ ID NO: 1 ; and an isolated nucleic acid molecule that 
has preferably, at least 55%, more preferably, 60%, 65%, 70%, 75%, 80%, 85%, or 
90%, and most preferably, 95% or 99% sequence identity with SEQ ID NO: 1, as 
deteimined using the BLAST program and parameters described herein. In another 
10 embodiment, the HDAC9 nucleic acid molecule consists of the nucleic acid 
sequence of SEQ ID NO: 1. 

AnffDAC9a nucleic acid molecule is a nucleic acid molecule that encodes 
an HDAC9a polypeptide. An HDAC9a nucleic acid molecule preferably has at least 
55%, sequence identity to SEQ ID NO: 3, In one embodiment, the HDAC9a nucleic 
15 acid molecule is selected from: a nucleic acid molecule that comprises the nucleic 
acid sequence of SEQ ID NO: 3; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 3; an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ID NO: 4; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 4; a nucleic acid that is 
20 hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes SEQ ID NO: 4; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 3; and an isolated 
nucleic acid molecule that has preferably, at least 55%, more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
25 with SEQ ID NO: 3 or a complement thereof, as determined using the BLAST 
program and parameters described herein. In another embodiment, the HDAC9a 
nucleic acid molecule consists of the nucleic acid sequence of SEQ ID NO: 3. 

An HDAC9(ANLS) nucleic acid molecule is a nucleic acid molecule that 
encodes an HDAC9(ANLS) polypeptide. In one embodiment, the HDAC9(ANLS) 
30 nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 
nucleic acid sequence of SEQ ED NO: 5; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 5; an isolated nucleic acid encoding a histone deacetylase 
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polypeptide of SEQ ID NO: 6; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 6; a nucleic acid that is 
hybridizeable under high stringency conditions to a nucleic acid molecule that 
encodes SEQ ID NO: 6; a nucleic acid molecule that is hybridizeable under high 
5 stringency conditions to a nucleic acid comprising SEQ ID NO: 5; and an isolated 
nucleic acid molecule that has preferably, at least 55%, more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
with SEQ ID NO: 5 or a complement thereof, as determined using the BLAST 
program and parameters described herein. In another embodiment, the 
1 0 HDA C9(ANLS) nucleic acid molecule consists of the nucleic acid sequence of SEQ 
ID NO: 5. 

An HDAC9a(ANLS) nucleic acid molecule is a nucleic acid molecule that 
encodes an HDAC9a(ANLS) polypeptide. In one embodiment, the HDAC9a(ANLS) 
nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 

15 nucleic acid sequence of SEQ ID NO: 7; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 7; an isolated nucleic acid encoding a histone deacetylase 
polypeptide of SEQ ED NO: 8; a complement of an isolated nucleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 8; a nucleic acid that is 
hybridizeable under high stringency conditions to a nucleic acid molecule that 

20 encodes SEQ ID NO: 8; a nucleic acid molecule that is hybridizeable under high 
stringency conditions to a nucleic acid comprising SEQ ID NO: 7; and an isolated 
nucleic acid molecule that has preferably, at least 55%, more preferably, 60%, 65%, 
70%, 75%, 80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity 
with SEQ ID NO: 7 or a complement thereof, as determined using the BLAST 

25 program and parameters described herein. In another embodiment, the 

HDAC9a(ANLS) nucleic acid molecule consists of the nucleic acid sequence of SEQ 
ED NO: 7. 

An "HDRP(ANLS) nucleic acid molecule" is a nucleic acid molecule that 
encodes an HDRP(ANLS) polypeptide. In one embodiment, the HDRP(ANLS) 
30 nucleic acid molecule is selected from: a nucleic acid molecule that comprises the 
nucleic acid sequence of SEQ ID NO: 9; a complement of an isolated nucleic acid 
comprising SEQ ID NO: 9; an isolated nucleic acid encoding a histone deacetylase 
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polypeptide of SEQ ID NO: 10; a complement of an isolated nueleic acid encoding a 
histone deacetylase polypeptide of SEQ ID NO: 10; and an isolated nucleic acid 
molecule that has preferably, at least 55%, more preferably, 60%, 65%, 70%, 75%, 
80%, 85%, or 90%, and most preferably, 95% or 99% sequence identity with SEQ 
5 ID NO: 9 or a complement thereof, as determined using the BLAST program and 
parameters described herein.. In another embodiment, the HDRP(ANLS) nucleic 
acid molecule consists of the nucleic acid sequence of SEQ ID NO: 9. 

The isolated nucleic acid molecules of the present invention can be KNA, for 
example, mRNA, or DNA, such as cDNA and genomic DNA. DNA molecules can 
10 be double-stranded or single-stranded; single stranded RNA or DNA can be either 
the coding, or sense, strand or the non-coding, or antisense, strand. The nucleic acid 
molecule can include all or a portion of the coding sequence of the gene and can 
further comprise additional non-coding sequences such as introns and non-coding 3' 
and 5' sequences (including regulatory sequences, for example). Additionally, the 
1 5 nucleic acid molecule can be fused to a marker sequence, for example, a sequence 
that encodes a polypeptide to assist in isolation or purification of the polypeptide. 
Such sequences include, but are not limited to, those that encode a 
glutathione-S-transferase (GST) fusion protein and those that encode a 
hemagglutinin A (HA) polypeptide marker from influenza. 
20 An "isolated," "substantially pure," or "substantially pure and isolated- 

nucleic acid molecule, as used herein, is one that is separated from nucleic acids that 
normally flank the gene or nucleotide sequence (as in genomic sequences) and/or has 
been completely or partially purified from other transcribed sequences {e.g., as in an 
RNA or cDNA library). For example, an isolated nucleic acid of the invention may 
25 be substantially isolated with respect to the complex cellular milieu in which it 
naturally occurs, or culture medium when produced by recombinant techniques, or 
chemical precursors or other chemicals when chemically synthesized. In some ' 
instances, the isolated material will form part of a composition (for example, a crude 
extract containing other substances), buffer system, or reagent mix. In other 
30 circumstances, the material may be purified to essential homogeneity, for example, 
as determined by agarose gel electrophoresis or column chromatography such as 
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HPLC. Preferably, an isolated nucleic acid molecule comprises at least about 50, 80, 
or 90% (on a molar basis) of all macromolecular species present. 

With regard to genomic DNA, the term "isolated" also can refer to nucleic 
acid molecules that are separated from the chromosome with which the genomic 
5 DNA is naturally associated. For example, the isolated nucleic acid molecule can 
contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb, or 0.1 kb of nucleotides 
that flank the nucleic acid molecule in the genomic DNA of the cell from which the 
nucleic acid molecule is derived. 

The HDAC9, HDAC9a, HDAC9(ANLS)> HDAC9a(ANLS), or HDRP(ANLS) 

1 0 nucleic acid molecule can be fused to other coding or regulatory sequences and still 
be considered isolated. Thus, recombinant DNA contained in a vector is included in 
the definition of "isolated" as used herein. Also, isolated nucleic acid molecules 
include recombinant DNA molecules in heterologous host cells, as well as partially 
or substantially purified DNA molecules in solution. "Isolated" nucleic acid 

15 molecules also encompass in vivo and in vitro RNA transcripts of the DNA 

molecules of the present invention. An isolated nucleic acid molecule or nucleotide 
sequence can include a nucleic acid molecule or nucleotide sequence that is 
synthesized chemically or by recombinant means. Therefore, recombinant DNA 
contained in a vector are included in the definition of "isolated" as used herein. 

20 Isolated nucleotide molecules also include recombinant DNA molecules in 

heterologous organisms, as well as partially or substantially purified DNA molecules 
in solution. In vivo and in vitro RNA transcripts of the DNA molecules of the 
present invention are also encompassed by "isolated" nucleotide sequences. Such 
isolated nucleotide sequences are useful in the manufacture of the encoded 

25 polypeptide, as probes for isolating homologous sequences (e.g., from other 
mammalian species), for gene mapping (e.g., by in situ hybridization with 
chromosomes), or for detecting expression of the gene in tissue (e.g., human tissue), 
such as by Northern blot analysis. 

The present invention also pertains to variant HDAC9, HDAC9a, 

30 HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules that are 
not necessarily found in nature but that encode an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide. Thus, for 
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15 



25 



30 



example, DNA molecules that comprise a sequence that is different from the 
naturally-occurring HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDKP(ANLS) nucleotide sequence but which, due to the degeneracy of the genetic 
code, encode an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS) polypeptide of the present invention are also the subject of this 
invention. 

The invention also encompasses HDAC9, HDAC9a, HDA C9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) nucleotide sequences encoding portions 
(fragments), or encoding variant polypeptides such as analogues or derivatives of an 
10 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide. Such variants can be naturally-occurring, such as in the case of allelic 
variation or single nucleotide polymorphisms, or non-naturally-occurring, such as 
those induced by various mutagens and mutagenic processes. Intended variations 
include, but are not limited to, addition, deletion, and substitution of one or more 
nucleotides that can result in conservative or non-conservative amino acid changes, 
including additions and deletions. Preferably, theHDAC9, HDAC9a, 
HDAC9(ANLS),HDAC9a(ANLS), or HDRP(ANLS) nucleotide (and/or resultant 
amino acid) changes are silent or conserved; that is, they do not alter the 
characteristics or activity of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide. In one preferred embodiment, the 
nucleotide sequences are fragments that comprise one or more polymorphic 
microsatellite markers. 

Other alterations of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS) nucleic acid molecules of the invention can 
include, for example, labeling, methylation, intemucleotide modifications such as 
uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, 
and carbamates), charged linkages (e.g., phosphorothioates or phosphorodithioates), 
pendent moieties (e.g., polypeptides), intercalators (e.g., acridine or psoralen), 
chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids). 
Also included are synthetic molecules that mimic nucleic acid molecules in the 
ability to bind to a designated sequences via hydrogen bonding and other chemical 
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interactions. Such molecules include, for example, those in which peptide linkages 
substitute for phosphate linkages in the backbone of the molecule. 

The invention also pertains to HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules that hybridize under 
5 high stringency hybridization conditions, such as for selective hybridization, to a 
nucleotide sequence described herein (e.g., nucleic acid molecules that specifically 
hybridize to a nucleotide sequence encoding polypeptides described herein, and, 
optionally, have an activity of the polypeptide). In one embodiment, the invention 
includes variants described herein that hybridize under high stringency hybridization 

10 conditions (e.g., for selective hybridization) to a nucleotide sequence comprising a 
nucleotide sequence selected from SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
SEQ ID NO: 7, SEQ ID NO: 9 and the complement of SEQ ID NO: 1, SEQ ID NO: 
3, SEQ ID NO: 5, SEQ D NO: 7, or SEQ ID NO: 9. In another embodiment, the 
invention includes variants described herein that hybridize under high stringency 

15 hybridization conditions (e.g., for selective hybridization) to a nucleotide sequence 
encoding an amino acid sequence of SEQ ID NO: 2 (HDAC9), SEQ ID NO: 4 
(HDAC9a), SEQ ID NO: 6 (HDAC9(ANLS)), SEQ ID NO: 8 (HDAC9a(ANLS)), or 
SEQ ID NO: 10 (HDRP(ANLS)). In a preferred embodiment, the variant that 
hybridizes under high stringency hybridizations encodes a polypeptide that has a 

20 biological activity of an HDAC9, HD AC9a, HD AC9(ANLS), HD AC9a(ANLS), or 
HDRP(ANLS) polypeptide (e.g., histone deacetylase activity or transcription 
repression activity). 

Such nucleic acid molecules can be detected and/or isolated by specific 
hybridization (e.g., under high stringency conditions). "Specific hybridization/ 5 as 

25 used herein, refers to the ability of a first nucleic acid to hybridize to a second 
nucleic acid in a manner such that the first nucleic acid does not hybridize to any 
nucleic acid other than to the second nucleic acid (e.g., when the first nucleic acid 
has a higher similarity to the second nucleic acid than to any other nucleic acid in a 
sample wherein the hybridization is to be performed). "Stringency conditions" for 

30 hybridization is a term of art that refers to the incubation and wash conditions, e.g., 
conditions of temperature and buffer concentration, that permit hybridization of a 
particular nucleic acid to a second nucleic acid; the first nucleic acid maybe 



WO 02/102984 



-29- 



PC1YUS02/19051 



perfectly (i. e ., 100%) complementary to the second, or the first and second may 
share some degree of complementarity that is less than perfect (e.g., 70%, 75%, 
85%, 95%). For example, certain high stringency conditions can be used that 
distinguish perfectly complementary nucleic acids from those of less 
5 complementarity. "High stringency conditions," "moderate stringency conditions," 
and "low stringency conditions" for nucleic acid hybridizations are explained on 
pages 2.10.1-2.10.16 and pages 6.3.1-6.3.6 in Current Protocols in Molecular 
Biology (See Ausubel et al., supra, the entire teachings of which are incorporated by 
reference herein). The exact conditions that determine the stringency of 
1 0 hybridization depend not only on ionic strength {e.g., 0.2XSSC or 0. 1XSSC), 
temperature (e.g., room temperature, 42°C or 68°C), and the concentration of 
destabilizing agents such as formamide or denaturing agents such as SDS, but also 
on factors such as the length of the nucleic acid sequence, base composition, percent 
mismatch between hybridizing sequences, and the frequency of occurrence of 
15 subsets of that sequence within other non-identical sequences. Thus, equivalent 
conditions can be determined by varying one or more of these parameters while 
maintaining a similar degree of identity or similarity between the two nucleic acid 
molecules. Typically, conditions are used such that sequences at least about 60%, at 
least about 70%, at least about 80%, at least about 90% or at least about 95% or 
20 more identical to each other remain hybridized to one another. By varying 

hybridization conditions from a level of stringency at which no hybridization occurs 
to a level at which hybridization is first observed, conditions that will allow a given 
sequence to hybridize (e.g., selectively) with the most similar sequences in the 
sample can be determined. 
25 Exemplary conditions are described in Krause and Aaronson, Methods in 

Enzymology, 200:546-556 (1991). Also, in, Ausubel, et al., supra, which describes 
the determination of washing conditions for moderate or low stringency conditions. 
Washing is the step in which conditions are usually set so as to determine a 
minimum level of complementarity of the hybrids. Generally, starting from the 
30 lowest temperature at which only homologous hybridization occurs, each °C by 
which the final wash temperature is reduced (holding SSC concentration constant) 
allows an increase by 1% in the maximum extent of mismatching among the 
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sequences that hybridize. Generally, doubling the concentration of SSC results in an 
increase in Tm of 17°C. Using these guidelines, the washing temperature can be 
determined empirically for high, moderate, or low stringency, depending on the level 
of mismatch sought. 

5 For example, a low stringency wash can comprise washing in a solution 

containing 0.2XSSC/0.1% SDS for 10 minutes at room temperature; a moderate 
stringency wash can comprise washing in a prewarmed solution (42°C) solution 
containing 0.2XSSC/0.1% SDS for 15 minutes at 42°C; and a high stringency wash 
can comprise washing in prewarmed (68°C) solution containing 0.1XSSC/0.1%SDS 

10 for 15 minutes at 68°C. Furthermore, washes can be performed repeatedly or 

sequentially to obtain a desired result as known in the art. Equivalent conditions can 
be determined by varying one or more of the parameters given as an example, as 
known in the art, while maintaining a similar degree of identity or similarity between 
the target nucleic acid molecule and the primer or probe used. 

15 To determine the percent homology or identity of two nucleic acid 

sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in the sequence of one polypeptide or nucleic acid molecule for 
optimal alignment with the other polypeptide or nucleic acid molecule). The amino 
acid residues or nucleotides at corresponding amino acid positions or nucleotide 

20 positions are then compared, as described above. 

The present invention also provides isolated HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) nucleic acid molecules that 
contain a fragment or portion that hybridizes under highly stringent conditions to a 
nucleotide sequence comprising a nucleotide sequence selected from SEQ ID NO: 1, 

25 SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and the complement 
of any of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID 
NO: 9 and also provides isolated nucleic acid molecules that contain a fragment or 
portion that hybridizes under highly stringent conditions to a nucleotide sequence 
encoding an amino acid sequence selected from SEQ ID NO: 2, SEQ ID NO: 4, SEQ 

30 ID NO: 6, SEQ ID NO: 8, and SEQ ID NO: 10. The nucleic acid fragments of the 
invention are at least about 15, preferably, at least about 18, 20, 23, or 25 
nucleotides, and can be 30, 40, 50, 100, 200 or more nucleotides in length. Longer 
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fragments, for example, 30 or more nucleotides in length, that encode antigenic 
polypeptides described herein are particularly useful, such as for the generation of 
antibodies as described above. 

In a related aspect, the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
5 and HDRP(ANLS) nucleic acid fragments of the invention are used as probes or 
primers in assays such as those described herein. "Probes" or "primers" are 
ohgonucleotides that hybridize in a base-specific manner to a complementary strand 
of nucleic acid molecules. Such probes and primers include polypeptide nucleic 
acids, as described in Nielsen et al, Science, 254, 1497-1500 (1991). As also used 
10 herein, the term "primer" in particular refers to a single-stranded oligonucleotide that 
acts as a point of initiation of template-directed DNA synthesis using well-known 
methods {e.g., PCR, LCR) including, but not limited to those described herein. 

Typically, a probe or primer comprises a region of nucleotide sequence that 
hybridizes to at least about 15, typically about 20-25, and more typically about 40, 
15 50 or 75, consecutive nucleotides of a nucleic acid molecule comprising a 

contiguous nucleotide sequence selected from: SEQ ID NO: 1, SEQ ID NO: 3, SEQ 
ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, the complement of any of SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and a sequence 
encoding an amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
20 SEQ ID NO: 8, or SEQ ID NO: 10. 

In preferred embodiments, a probe or primer comprises 100 or fewer 
nucleotides, preferably, from 6 to 50 nucleotides, and more preferably, from 12 to 30 
nucleotides. Mother embodiments, the probe or primer is at least 70% identical to 
the contiguous nucleotide sequence or to the complement of the contiguous 
25 nucleotide sequence, preferably, at least 80% identical, more preferably, at least 90% 
identical, even more preferably, at least 95% identical, or even capable of selectively 
hybridizing to the contiguous nucleotide sequence or to the complement of the 
contiguous nucleotide sequence. Often, the probe or primer further comprises a 
label, e.g., radioisotope, fluorescent compound, enzyme, or enzyme co-factor. 
30 The nucleic acid molecules of the invention such as those described above 

can be identified and isolated using standard molecular biology techniques and the 
sequence information provided in SEQ ID NO: 1, SEQ ID NO; 3, SEQ ID NO: 5, 
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SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 8, and /or SEQ ID NO: 10. For example, nucleic acid molecules can 
be amplified and isolated by the polymerase chain reaction using synthetic 
oligonucleotide primers designed based on one or more of the nucleic acid 
5 sequences provided above and/or the complement of those sequences. Or such 
nucleic acid molecules may be designed based on nucleotide sequences encoding 
one or more of the amino acid sequences provided in SEQ ID NO: 2, SEQ ID NO: 4, 
SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. See generally PCR Technology: 
Principles and Applications for DNA Amplification (ed. H. A. Erlich, Freeman Press, 

10 NY, NY, (1 992); PCR Protocols: A Guide to Methods and Applications (Eds. Innis 
et al. 9 Academic Press, San Diego, CA, (1990); Mattila et al., Nucleic Acids Res., 
19: 4967 (1991); Eckert et al, PCR Methods and Applications, 1 : 17 (1991); PCR 
(eds. McPherson et al, IRL Press, Oxford)); and U.S. Patent No. 4,683,202. The 
nucleic acid molecules can be amplified using cDNA, mRNA, or genomic DNA as a 

15 template, cloned into an appropriate vector and characterized by DNA sequence 
analysis. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(See Wu and Wallace, Genomics, 4:560 (1989), Landegren etal, Science, 241:1077 
(1988)), transcription amplification (Kwoh et al, Proc. Natl. Acad. Sci. USA, 

20 86:1 173 (1989)), and self-sustained sequence replication (See Guatelli et al, Proc. 
Nat. Acad. Sci. USA, 87:1874 (1990)) and nucleic acid based sequence 
amplification (NASBA). The latter two amplification methods involve isothermal 
reactions based on isothermal transcription, that produce both single stranded RNA 
(ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio 

25 of about 30 or 100 to 1 , respectively. 

The amplified DNA can be radiolabeled and used as a probe for screening a 
cDNA library derived from human cells, mRNA in zap express, ZIPLOX, or other 
suitable vector. Corresponding clones can be isolated, DNA can be obtained 
following in vivo excision, and the cloned insert can be sequenced in either or both 

30 orientations by art-recognized methods to identify the correct reading frame 

encoding a polypeptide of the appropriate molecular weight. For example, the direct 
analysis of the nucleotide sequence of nucleic acid molecules of the present 
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invention can be accomplished using well-known methods that are commercially 
available. See, for example, Sambrook et a!., Molecular Cloning, A Laboratory 
Manual (2nd Ed., CSHP, New York (1989)); Zyskind et ah, Recombinant DNA 
Laboratory Manual, (Acad. Press, (1988)). Usingthese or similar methods, the 
5 polypeptide and the DNA encoding the polypeptide can be isolated, sequenced, and 
further characterized. 

Antisense nucleic acid molecules of the invention can be designed using the 
nucleotide sequences of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9 and/or the complement of any of SEQ ID NO: 1, SEQ ID NO: 
10 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9 and/or a portion of those 

sequences, and/or the complement of those portion or sequences, and/or a sequence 
encoding the amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, 
SEQ ID NO: 8, SEQ ID NO: 10, or encoding a portion of SEQ ID NO: 2, SEQ ID 
NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. Such antisense nucleic 
15 acid molecules can be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. For example, an antisense nucleic acid 
molecule (e.g., an antisense oligonucleotide) can be chemically synthesized using 
naturally occurring nucleotides or variously modified nucleotides designed to 
increase the biological stability of the molecules or to increase the physical stability 
20 of the duplex formed between the antisense and sense nucleic acids, e.g., 

phosphorothioate derivatives and acridine substituted nucleotides can be used. 
Alternatively, the antisense nucleic acid molecule can be produced biologically using 
an expression vector into which a nucleic acid molecule has been subcloned in an 
antisense orientation (i.e., RNA transcribed from the inserted nucleic acid molecule 
25 will be of an antisense orientation to a target nucleic acid of interest). 

hi general, the isolated HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
and HDRP(ANLS) nucleic acid sequences of the invention can be used as molecular 
weight markers on Southern blots, and as chromosome markers that are labeled to 
map related gene positions. The nucleic acid sequences can also be used to compare 
30 with endogenous DNA sequences in patients to identify genetic disorders (e.g„ a 
predisposition for or susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease), and as probes, such as to hybridize and 
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discover related DNA sequences or to subtract out known sequences from a sample. 
The nucleic acid molecules of the present invention can also be used as therapeutic 
agents. 

By a "cell proliferation disease" is meant a disease that is caused by or results 
5 in undesirably high levels of cell division, undesirably low levels of apoptosis, or 
both. For example, cancers such as lymphoma, leukemia, melanoma, ovarian 
cancer, breast cancer, pancreatic cancer, prostate cancer, colon cancer, and lung 
cancer are all examples of cell proliferation diseases. Myeloproliferative disorders, 
including polycythemia vera, essential thrombocythemia, agnogenic myeloid 

10 metaplasia, and chronic myelogenous leukemia are also cell proliferation diseases. 

By a "cell differentiation disease" is meant a disease that is caused by or 
results in undesirably low levels of cell differentiation, or by undesirably high levels 
of cell differentiation. For example, cancers such as lymphoma, leukemia, 
melanoma, ovarian cancer, breast cancer, pancreatic cancer, prostate cancer, colon 

15 cancer, and lung cancer are all examples of cell differentiation diseases. 
Myeloproliferative disorders, including polycythemia vera, essential 
thrombocythemia, agnogenic myeloid metaplasia, and chronic myelogenous 
leukemia are also cell differentiation diseases. 

By an "apoptotic disease" is meant a condition in which the apoptotic 

20 response is abnormal. This may pertain to a cell or a population of cells that does 
not undergo cell death under appropriate conditions. For example, normally a cell 
will die upon exposure to apoptotic-triggering agents, such as chemotherapeutic 
agents, or ionizing radiation. When, however, a subject has an apoptotic disease, for 
example, cancer, the cell or a population of cells may not undergo cell death in 

25 response to contact with apoptotic-triggering agents. In addition, a subject may have 
an apoptotic disease when the occurrence of cell death is too low, for example, when 
the number of proliferating cells exceeds the number of cells undergoing cell death, 
as occurs in cancer when such cells do not properly differentiate. 

An apoptotic disease may also be a condition characterized by the occurrence 

30 of undesirably high levels of apoptosis. For example, certain neurodegenerative 
diseases, including but not limited to Alzheimer's disease, Parkinson's disease, 
amyotrophic lateral sclerosis, multiple sclerosis, restenosis, stroke, and ischemic 
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brain injury are apoptotic diseases in which neuronal cells undergo undesired cell 



death 



10 



Other diseases for which the polypeptides and nucleic acid molecules of the 
present invention may be useful for diagnosing and/or treating include, but are not 
limited to Huntington's disease. 

The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and 
HDRP(ANLS) nucleic acid molecules of the present invention can further be used to 
derive primers for genetic fingerprinting, to raise anti-polypeptide antibodies using 
DNA immunization techniques, and as an antigen to raise anti-DNA antibodies or 
elicit immune responses. Portions or fragments of the nucleotide sequences 
identified herein (and the corresponding complete gene sequences) can be used in 
numerous ways as polynucleotide reagents. For example, these sequences can be 
used to: (i) map their respective genes on a chromosome; and, thus, locate gene 
regions associated with genetic disease; (ii) identify an individual from a minute 
15 biological sample (tissue typing); and (iii) aid in forensic identification of a 
biological sample. 

In addition, the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and 
HDRP(ANLS) nucleotide sequences of the invention can be used to identify and 
express recombinant polypeptides for analysis, characterization, or therapeutic use, 
20 or as markers for tissues in which the corresponding polypeptide is expressed, either 
constitutively, during tissue differentiation, or in diseased states. The nucleic acid 
sequences can additionally be used as reagents in the screening and/or diagnostic 
assays described herein, and can also be included as components of kits (e.g., 
reagent kits) for use in the screening and/or diagnostic assays described herein. 
25 Standard techniques, such as the polymerase chain reaction (PGR) and DNA 

hybridization, may be used to clone HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) homologs in other species, for example, 
mammalian homologs. HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) homologs may be readily identified using low-stringency DNA 
30 hybridization or low-stringency PCR with human HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), oxHDRP(ANLS) probes or primers. Degenerate 
primers encoding human HDAC9, HDACPa, HDAC9(ANLS), HDAC9a(ANLS), or 



WO 02/102984 



-36- 



PCT/US02/19051 



HDRP(ANLS) polypeptides may be used to clone HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) homologs by RT-PCR. 

Alternatively, additional HDAC9, HDAC9a, HDAC9(WLS), 
HDAC9a(ANLS), or HDRP(ANLS) homologs can be identified by utilizing 
5 consensus sequence information for HDAC9, HDAC9a, HDAC9(ANLS), 

HDAC9a(ANLS), or HDRP(ANLS) polypeptides to search for similar polypeptides 
in other species. For example, polypeptide databases for other species can be 
searched for proteins with the HDAC domains described herein. Candidate 
polypeptides containing such a motif can then be tested for their HDAC9, HDAC9a, 
10 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) biological activities, using 
methods described herein. 

EXPRESSION OF THE NUCLEIC ACID MOLECULES OF THE INVENTION 

Another aspect of the invention pertains to nucleic acid constructs containing 

15 an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic 
acid molecule, for example, one selected from the group consisting of SEQ ID NO: 
1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9, and the 
complement of any of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 
7, or SEQ ID NO: 9 (or portions thereof). Yet another aspect of the invention 

20 pertains to HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) 
nucleic acid constructs containing a nucleic acid molecule encoding the amino acid 
sequence of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ 
ID NO: 10. The constructs comprise a vector (e.g., an expression vector) into which 
a sequence of the invention has been inserted in a sense or antisense orientation. 

25 As used herein, the term 'Vector" or "construct" refers to a nucleic acid 

molecule capable of transporting another nucleic acid to which it has been linked. 
One type of vector is a "plasmid," which refers to a circular double stranded DNA 
loop into which additional DNA segments can be ligated. Another type of vector is 
a viral vector, wherein additional DNA segments can be ligated into the viral 

30 genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
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15 



mammalian vectors) are integrated into the genome of a host cell npon introduction 
into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors, expression vectors, are capable of directing the expression of genes ' 
to which they are operably linked. In general, expression vectors of utility in 
5 recombinant DNA techniques are often in the form of plasmids. However, the 
invention is intended to include such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated 
viruses) that serve equivalent functions. 

Preferred recombinant expression vectors of the invention comprise a nucleic 
10 acid molecule of the invention in a form suitable for expression of the nucleic acid 
molecule in a host cell. Ibis means that the recombinant expression vectors include 
one or more regulatory sequences, selected on the basis of the host cells to be used 
for expression, which is operably linked to the nucleic acid sequence to be 
expressed. Within a recombinant expression vector, "operably linked" is intended to 
mean that the nucleotide sequence of interest is linked to the regulatory sequence^) 
in a manner that allows for expression of the nucleotide sequence (e.g, in an in vitro 
transcription/translation system or in a host cell when the vector is introduced into 
the host cell). The term "regulatory sequence" is intended to include promoters, 
enhancers and other expression control elements (e.g., polyadenylation signals).' 
20 Such regulatory sequences are described, for example, in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1990). 
Regulatory sequences include those that direct constitutive expression of a 
nucleotide sequence in many types of host cell and those that direct expression of the 
nucleotide sequence only in certain host cells (e.g, tissue-specific regulatory 
25 sequences). 

It will be appreciated by those skilled in the art that the design of the 
expression vector can depend on such factors as the choice of the host cell to be 
transformed and the level of expression of polypeptide desired. The expression 
vectors of the invention can be introduced into host cells to thereby produce 
30 polypeptides, including fusion polypeptides, encoded by nucleic acid molecules as 
described herein. 
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The recombinant expression vectors of the invention can be designed for 
expression of a polypeptide of the invention in prokaryotic or eukaryotic cells, e.g., 
bacterial cells, such as E. coli, insect cells (using baculovirus expression vectors), 
yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
5 supra. Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example, using T7 promoter regulatory sequences and T7 
polymerase. 

Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention has been introduced. The terms 

10 "host cell" and "recombinant host cell" are used interchangeably herein. It is 

understood that such terms refer not only to the particular subject cell but also to the 
progeny or potential progeny of such a cell. Because certain modifications may 
occur in succeeding generations due to either mutation or environmental influences, 
such progeny may not, in fact, be identical to the parent cell, but are still included 

1 5 within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a nucleic 
acid molecule of the invention can be expressed in bacterial cells (e.g.,E. coli), 
insect cells, yeast, or mammalian cells (such as Chinese hamster ovary cells (CHO) 
or COS cells, human 293T cells, HeLa cells, MH 3T3 cells, and mouse 

20 erythroleukemia (MEL) cells). Other suitable host cells are known to those skilled 
in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of 

25 art-recognized techniques for introducing a foreign nucleic acid molecule (e.g., 
DNA) into a host cell, including calcium phosphate or calcium chloride 
co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Suitable methods for transforming or transfecting host cells can be 
found in Sambrook, et al. (supra), and other laboratory manuals* 

30 For stable transfection of mammalian cells, it is known that, depending upon 

the expression vector and transfection technique used^ only a small fraction of cells 
may integrate the foreign DNA into their genome. In order to identify and select 
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these integrants, a gene that encodes a selectable marker {e.g., for resistance to 
antibiotics) is generally introduced into the host cells along with the gene of interest 
Preferred selectable markers include those that confer resistance to drugs, such as 
G418, hygromycin, or methotrexate. Nucleic acid molecules encoding a selectable 
5 marker can be introduced into a host cell on the same vector as the nucleic acid 
molecule of the invention or can be introduced on a separate vector. Cells stably 
transfected with the introduced nucleic acid molecule can be identified by drug 
selection (e.g., cells that have incorporated the selectable marker gene will survive, 
while the other cells die). 

10 A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 

culture, can be used to produce (i.e., express) a polypeptide of the invention. 
Accordingly, the invention further provides methods for producing a polypeptide 
using the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of invention (into which a recombinant expression vector 
1 5 encoding a polypeptide of the invention has been introduced) in a suitable medium 
such that the polypeptide is produced. In another embodiment, the method further 
comprises isolating the polypeptide from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 
transgenic animals. For example, in one embodiment, a host cell of the invention is 
20 a fertilized oocyte or an embryonic stem cell into which an HDAC9, HDAC9a, 
HDAC9(ANLS), HDA C9a(ANLS), or HDKP(ANLS) nucleic acid molecule of the 
invention has been introduced. Such host cells can then be used to create 
non-human transgenic animals in which exogenous nucleotide sequences have been 
introduced into the genome or homologous recombinant animals in which 
25 endogenous nucleotide sequences have been altered. Such animals are useful for 
studying the function and/or activity of the nucleotide sequence and polypeptide 
encoded by the sequence and for identifying and/or evaluating modulators of their 
activity. 

As used herein, a "transgenic animal" is a non-human animal, preferably, a 
30 mammal, more preferably, a rodent such as a rat or mouse, in which one or more' of 
the cells of the animal includes a transgene. Other examples of transgenic animals 
include non-human primates, sheep, dogs, cows, goats, chickens, and amphibians. A 
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transgene is exogenous DNA that is integrated into the genome of a cell from which 
a transgenic animal develops and that remains in the genome of the mature animal, 
thereby directing the expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal. As used herein, a "homologous 
5 recombinant animal" is a non-human animal, preferably, a mammal, more 

preferably, a mouse, in which an endogenous gene has been altered by homologous 
recombination between the endogenous gene and an exogenous DNA molecule 
introduced into a cell of the animal, e.g., an embryonic cell of the animal, prior to 
development of the animal. 

10 Methods for generating transgenic animals via embryo manipulation and 

microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, 
U.S. Patent No. 4,873,191, and in Hogan, Manipulating the Mouse Embryo (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (1986)). Methods for 

1 5 constructing homologous recombination vectors and homologous recombinant 
animals are described further in Bradley, Current Opinion in Bio/Technology, 
2:823-829 (1991) and in PCT Publication Nos. WO 90/1 1354, WO 91/01 140, WO 
92/0968, and WO 93/04169. Clones of the non-human transgenic animals described 
herein can also be produced according to the methods described in Wilmut et aL, 

20 Nature, 385:810-813 (1997) and PCT Publication Nos. WO 97/07668 and WO 
97/07669. 

ANTIBODIES OF THE INVENTION 

Polyclonal and/or monoclonal antibodies that selectively bind one form of an 

25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide but not another form of the polypeptide are also provided. Antibodies 
are also provided that bind a portion of either the variant or reference HD AC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide that 
contains the polymorphic site or sites. 

30 In another aspect, the invention provides antibodies to each of the HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) polypeptides and 
polypeptide fragments of the invention, having an amino acid sequence encoded 
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by SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO: 10, 
or a portion thereof, or having an amino acid sequence encoded by a nucleic acid 
molecule comprising all or a portion of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO 
5, SEQ ID NO: 7, or SEQ ID NO: 9, (e.g., SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
5 NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, or another variant, or portion thereof). 
The term "purified antibody" as used herein refers to immunoglobulin 
molecules and immunologically active portions of immunoglobulin molecules, i.e., 
molecules that contain an antigen binding site that selectively binds an antigen. A 
molecule that selectively binds to a polypeptide of the invention is a molecule that 
10 binds to that polypeptide or a fragment thereof, but does not substantially bind other 
molecules in a sample, e.g., a biological sample that naturally contains the 
polypeptide. Preferably the antibody is at least 60%, by weight, free from proteins 
and naturally occurring organic molecules with which it naturally associated. More 
preferably, the antibody preparation is at least 75% or 90%, and most preferably, 
15 99%, by weight, antibody. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab')2 fragments that can be 
generated by treating the antibody with an enzyme such as pepsin. 

The invention provides polyclonal and monoclonal antibodies that selectively 
bind to an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
20 polypeptide of the invention. The term "monoclonal antibody" or "monoclonal 
antibody composition," as used herein, refers to a population of antibody molecules 
that contain only one species of an antigen binding site capable of immunoreacting 
with a particular epitope of a polypeptide of the invention. A monoclonal antibody 
composition thus typically displays a single binding affinity for a particular 
25 polypeptide of the invention with which it immunoreacts. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a desired immunogen, e.g., an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide of the invention or 
fragment thereof. The antibody titer in the immunized subject can be monitored 
30 over time by standard techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized polypeptide. If desired, the antibody molecules 
directed against the polypeptide can be isolated from the mammal {e.g., from the 
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blood) and further purified by well-known techniques, such as protein A 
chromatography to obtain the IgG fraction. 

At an appropriate time after immunization, e.g., when the antibody titers are 
highest, antibody-producing cells can be obtained from the subject and used to 
5 prepare monoclonal antibodies by standard techniques, such as the hybridoma 

technique originally described by Kohler and Milstein, Nature, 256:495-497 (1975), 
the human B cell hybridoma technique (Kozbor et aL, Immunol. Today, 4:72 
(1983)), the EBV-hybridoma technique (Cole et aL, Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 (1985)) or trioma techniques. The 

10 technology for producing hybridomas is well known (see generally Current Protocols 
in Immunology, Coligan et aL, (eds.) John Wiley & Sons, Inc., New York, NY 
(1994)). Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with an immunogen as described 
above, and the culture supernatants of the resulting hybridoma cells are screened to 

1 5 identify a hybridoma producing a monoclonal antibody that binds a polypeptide of 
the invention. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating a monoclonal 
antibody to a polypeptide of the invention (see, e.g., Current Protocols in 

20 Immunology, supra; Galfre et aL, (1977) Nature, 266:55052; R.H. Kenneth, in 
Monoclonal Antibodies: A New Dimension hi Biological Analyses, Plenum 
Publishing Corp., New York, New York (1980); and Lerner, Yale J. Biol. Med., 
54:387-402 (1981)). Moreover, the ordinarily skilled worker will appreciate that 
there are many variations of such methods that also would be useful. 

25 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal antibody to an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
or HDRP(ANLS) polypeptide of the invention can be identified and isolated by 
screening a recombinant combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with the polypeptide to thereby isolate immunoglobulin 

30 library members that bind the polypeptide. Kits for generating and screening phage 
display libraries are commercially available (e.g., the Pharmacia Recombinant Phage 
Antibody System, Catalog No. 27-9400-01; and the Stratagene SurfZAP™ Phage 
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Display Kit, Catalog No. 240612). Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening antibody display library 
can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 92/20791; 
5 PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 
Publication No. WO 90/02809; Fuchs et al, Bio/Technology, 9:1370-1372 (1991); 
Hay et al., Hum. Antibod. Hybridomas, 3:81-85 (1992); Huse et al, Science, 
246:1275-1281 (1989); and Griffiths era/., EMBO J., 12:725-734 (1993). 
10 Additionally, recombinant antibodies, such as chimeric and humanized 

monoclonal antibodies, comprising both human and non-human portions, which can 
be made using standard recombinant DNA techniques, are within the scope of the 
invention. Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art. 
15 In general, antibodies of the invention (e.g., a monoclonal antibody) can be 

used to isolate an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide of the invention by standard techniques, such as affinity 
chromatography or immunoprecipitation. A polypeptide-specific antibody can 
facilitate the purification of natural polypeptide from cells and of recombinantly 
20 produced polypeptide expressed in host cells. Moreover, an antibody specific for an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide of the invention can be used to detect the polypeptide (e.g, in a cellular 
lysate, cell supernatant, or tissue sample) in order to evaluate the abundance and 
pattern of expression of the polypeptide. 
25 The antibodies of the present invention can also be used diagnostically to 

monitor protein levels in tissue as part of a clinical testing procedure, e.g., to, for 
example, determine the efficacy of a given treatment regimen. Detection can be 
facilitated by coupling the antibody to a detectable substance. Examples of 
detectable substances include various enzymes, prosthetic groups, fluorescent 
30 materials, luminescent materials, bioluminescent materials, and radioactive 

materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 
phosphatase, P-galactosidase, and acetylcholinesterase; examples of suitable 
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prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples 
of suitable fluorescent materials include umbelliferone, fluorescein, fluorescein 
isothiocyanate, rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride and 
phyeoerythrin; an example of a luminescent material includes luminol; examples of 
5 bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include 125 1, 131 1, 35 S, and 3 H. 

DIAGNOSTIC AND SCREENING ASSAYS OF THE INVENTION 

The present invention also pertains to diagnostic assays for assessing HDAC 

10 9 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) gene expression, or 
for assessing activity of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptides of the invention. In one embodiment, the assays are 
used in the context of a biological sample {e.g., blood, serum, cells, tissue) to 
thereby determine whether an individual is afflicted with a cell proliferation disease, 

15 an apoptotic disease, or a cell differentiation disease, or is at risk for (has a 

predisposition for or a susceptibility to) developing a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. The invention also provides for 
prognostic (or predictive) assays for determining whether an individual is 
susceptible to developing a cell proliferation disease, an apoptotic disease, or a cell 

20 differentiation disease, For example, mutations in the HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), orHDRP(ANLS) nucleic acid molecule can be 
assayed in a biological sample. Such assays can be used for prognostic or predictive 
purpose to thereby prophylactically treat an individual prior to the onset of 
symptoms associated with a cell proliferation disease, an apoptotic disease, or a cell 

25 differentiation disease. 

Another aspect of the invention pertains to assays for monitoring the 
influence of agents, or candidate compounds {e.g., drugs or other agents) on the 
nucleic acid molecule expression or biological activity of polypeptides of the 
invention, as well as to assays for identifying candidate compounds that bind to an 

30 HDAC9, HDAC9a polypeptide, an HDAC9(ANLS) polypeptide, an 

HDAC9a(ANLS) polypeptide, or an HDRP(ANLS) polypeptide. These and other 
assays and agents are described in further detail in the following sections. 
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DIAGNOSTIC ASSAYS 

HDAC9, HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS), or HDRP(ANLS) 
nucleic acid molecules, probes, primers, polypeptides, and antibodies to an HDAC9, 
5 an HDAC9a protein, an HDAC9(ANLS) protein, an HDAC9a(ANLS) protein, or an 
HDRP(ANLS) protein can be used in methods of diagnosis of a susceptibility to, or 
likelihood of having a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease, as well as in kits useful for diagnosis of a susceptibility to a 
cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 

10 In one embodiment of the invention, diagnosis of a decreased susceptibility 

to a cell proliferation disease, an apoptotic disease, or a cell differentiation disease is 
made by detecting a polymorphism in HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). The polymorphism can be a mutation in 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), such as the 

1 5 insertion or deletion of a single nucleotide, or of more than one nucleotide, resulting 
in a frame shift mutation; the change of at least one nucleotide, resulting in a change 
in the encoded amino acid; the change of at least one nucleotide, resulting in the 
generation of a premature stop codon; the deletion of several nucleotides, resulting 
in a deletion of one or more amino acids encoded by the nucleotides; the insertion of 

20 one or several nucleotides, such as by unequal recombination or gene conversion, 
resulting in an interruption of the coding sequence of the gene; duplication of all or a 
part of the gene; transposition of all or a part of the gene; or rearrangement of all or a 
part of the gene, or a change in the expression pattern of the various HDAC9 
isoforms. More than one such mutation may be present in a single nucleic acid 

25 molecule. 

Such sequence changes cause a mutation in the polypeptide encoded by 
HDAC9, HDAC9a, HDAC9(ANLS) > HDAC9a(ANLS), or HDRP(ANLS). For 
example, if the mutation is a frame shift mutation, the frame shift can result in a 
change in the encoded amino acids, and/or can result in the generation of a 
30 premature stop codon, causing generation of a truncated polypeptide. Alternatively, 
a polymorphism associated with a decreased susceptibility to a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease can be a synonymous 
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mutation in one or more nucleotides (i.e., a mutation that does not result in a change 
in the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide). Such a polymorphism may alter sites, affect the stability or transport 
of mRNA, or otherwise affect the transcription or translation of the nucleic acid 
5 molecule. HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 
that has any of the mutations described above is referred to herein as a "mutant 
nucleic acid molecule." 

In a first method of diagnosing a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease, 

1 0 hybridization methods, such as Southern analysis, Northern analysis, or in situ 
hybridizations, can be used (see Ausubel, et aL, supra). For example, a biological 
sample from a test subject (a "test sample") of genomic DNA, RNA, or cDNA, is 
obtained from an individual suspected of having, being susceptible to or predisposed 
for, or carrying a defect for, a cell proliferation disease, an apoptotic disease, or a 

15 cell differentiation disease (the "test individual"). The individual can be an adult, 
child, or fetus. The test sample can be from any source that contains genomic DNA, 
such as a blood sample, sample of amniotic fluid, sample of cerebrospinal fluid, or 
tissue sample from skin, muscle, buccal or conjunctival mucosa, placenta, 
gastrointestinal tract, or other organs. A test sample of DNA from fetal cells or 

20 tissue can be obtained by appropriate methods, such as by amniocentesis or 

chorionic villus sampling. The DNA, RNA, or cDNA sample is then examined to 
determine whether a polymorphism in HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) is present, and/or to determine which variant(s) 
encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) 9 or HDRP(ANLS) 

25 is present. The presence of the polymorphism or variant(s) can be indicated by 
hybridization of the gene in the genomic DNA, RNA, or cDNA to a nucleic acid 
probe. A "nucleic acid probe," as used herein, can be a DNA probe or an RNA 
probe; the nucleic acid probe can contain at least one polymorphism in HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRPJANLS) or contains a nucleic 

30 acid encoding a particular variant of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). The probe can be any of the nucleic acid 
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molecules described above (e.g., the entire nucleic acid molecule, a fragment, a 
vector comprising the gene, a probe, or primer, etc.). 

To diagnose a decreased susceptibility to a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease, a hybridization sample is formed 
5 by contacting the test sample containing HDAC9, HDAC9a, HDAC9(ANLS), 

HDA C9a(ANLS), or HDRP(ANLS), with at least one nucleic acid probe. A preferred 
probe for detecting mRNA or genomic DNA is a labeled nucleic acid probe capable 
of hybridizing to HDAC9, HDAC9a > HDAC9(ANLS), HDA C9a(ANLS)> or 
HDRP(ANLS) mRNA or genomic DNA sequences described herein. The nucleic 
10 acid probe can be, for example, a full-length nucleic acid molecule, or a portion 
thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250, or 500 
nucleotides in length and sufficient to specifically hybridize under stringent 
conditions to appropriate mRNA or genomic DNA. For example, the nucleic acid 
probe can be all or a portion of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ 
15 ID NO: 7, SEQ ID NO: 9, or the complement of SEQ ID NO: 1 or SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, SEQ ID NO: 9; or can be a nucleic acid molecule 
encoding all or a portion of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID 
NO: 8, or SEQ ID NO: 10. Other suitable probes for use in the diagnostic assays of 
the invention are described above (see. e.g., probes and primers discussed under the 
20 heading, "Nucleic Acids of the Invention"). 

The hybridization sample is maintained under conditions that are sufficient to 
allow specific hybridization of the nucleic acid probe to HDAC9, HDAC9a y 
HDAC9(ANLS), HDA C9a( ANLS), or HDRP(ANLS). "Specific hybridization," as 
used herein, indicates exact hybridization (e.g., with no mismatches). Specific 
25 hybridization can be performed under high stringency conditions or moderate 

stringency conditions, for example, as described above. In a particularly preferred 
embodiment, the hybridization conditions for specific hybridization are high 
stringency. 

Specific hybridization, if present, is then detected using standard methods. If 
30 specific hybridization occurs between the nucleic acid probe and HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in the test sample, thoaHDAC9 9 
HDAC9a, HDA C9( ANLS), HDAC9a(ANLS), or HDRP(ANLS) has the 
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polymorphism, or is the variant, that is present in the nucleic acid probe. More than 
one nucleic acid probe can also be used concurrently in this method. Specific 
hybridization of any one of the nucleic acid probes is indicative of a polymorphism 
in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), or of the 
5 presence of a particular variant encoded by HDAC9, HDA C9a, HDAC9(ANLS), 
HDAC9a(ANLS) 9 or HDRP(ANLS), and is therefore diagnostic for a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

In Northern analysis (see Current Protocols in Molecular Biology, Ausubel, 

10 et al y supra), the hybridization methods described above are used to identify the 
presence of a polymorphism or of a particular variant, associated with a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. For Northern analysis, a test sample of RNA is obtained 
from the individual by appropriate means. Specific hybridization of a nucleic acid 

15 probe, as described above, to RNA from the individual is indicative of a 
polymorphism in HDAC9, HDAC9a y HDAC9(ANLS), HDA C9a(ANLS), or 
HDRF(ANLS) 9 or of the presence of a particular variant encoded by HDAC9, 
HDAC9a, HDA C9(ANLS) y HDAC9a(ANLS) 9 or HDRP(ANLS), and is therefore 
diagnostic for a decreased susceptibility to a cell proliferation disease, an apoptotic 

20 disease, or a cell differentiation disease. 

For representative examples of use of nucleic acid probes, see, for example, 
U.S. Patent Nos. 5,288,611 and 4,851,330. 

Alternatively, a peptide nucleic acid (PNA) probe can be used instead of a 
nucleic acid probe in the hybridization methods described above. PNA is a DNA 

25 mimic having a peptide-like, inorganic backbone, such as N-(2-aminoethyl)glycine 
units, with an organic base (A, G, C, T, or U) attached to the glycine nitrogen via a 
methylene carbonyl linker (see, for example, Nielsen et al 9 Bioconjugate Chemistry, 
5 (1994), American Chemical Society, p. 1 (1994)). The PNA probe can be 
designed to specifically hybridize to a gene having a polymorphism associated with 

30 a susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. Hybridization of the PNA probe to HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is diagnostic for a decreased 
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susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

In another method of the invention, mutation analysis by restriction digestion 
can be used to detect a mutant nucleic acid molecule, or nucleic acid molecules 
5 containing a polymorphism(s), if the mutation or polymorphism in the gene results 
in the creation or elimination of a restriction site. A test sample containing genomic 
DNA is obtained from the individual. Polymerase chain reaction (PCR) can be used 
to amplify HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 
(and, if necessary, the flanking sequences) in the test sample of genomic DNA from 
1 0 the test individual. RFLP analysis is conducted as described (see Current Protocols 
in Molecular Biology, supra). The digestion pattern of the relevant DNA fragment 
indicates the presence or absence of the mutation or polymoiphism in HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDKP(ANLS), and therefore 
indicates the presence or absence of this decreased susceptibility to a cell 
15 proliferation disease, an apoptotic disease, or a cell differentiation disease. 

Sequence analysis can also be used to detect specific polymorphisms in 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS). A test 
sample of DNA or RNA is obtained from the test individual. PCR or other 
appropriate methods can be used to amplify the nucleic acid molecule, and/or its 
20 flanking sequences, if desired. The sequence of HDAC9, HDAC9a, HDA C9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS), or HDRP(ANLS), or a fragment of the any of 
those nucleic acid molecules, or an HDAC9, HDAC9a, HDA C9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) cDNA or a fragment of any of those cDNAs, or 
an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) mRNA 
25 or a fragment of any of those mRNAs, is determined, using standard methods. The 
sequence of the above gene, gene fragment, cDNA, cDNA fragment, mRNA, or 
mRNA fragment is compared with the known nucleic acid sequence of the nucleic 
acid molecule, cDNA (e.g., SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID 
NO: 7, SEQ ID NO: 9, or a nucleic acid sequence encoding the protein of SEQ ID 
30 NO: 2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, or a fragment 
thereof) or mRNA, as appropriate. The presence of a polymorphism in HDAC9, 
HDAC9a, HDA C9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) indicates that the 
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individual has a decreased susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease. 

Allele-specific oligonucleotides can also be used to detect the presence of a 
polymorphism in HDAC9, HDAC9a, HDAC9(ANLS) 9 HDA C9a( ANLS), or 
5 HDRP(ANLS), through the use of dot-blot hybridization of amplified 

oligonucleotides with allele-specific oligonucleotide (ASO) probes (see, for 
example, Saiki et aL, Nature (London) 324:163-166 (1986)). An "allele-specific 
oligonucleotide" (also referred to herein as an "allele-specific oligonucleotide 
probe") is an oligonucleotide of approximately 10-50 base pairs, preferably 

10 approximately 15-30 base pairs, that specifically hybridizes to HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), and that contains a 
polymorphism associated with a decreased susceptibility to a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease. An allele-specific 
oligonucleotide probe that is specific for particular polymorphisms in HDAC9, 

1 5 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) can be prepared, 
using standard methods (see Current Protocols in Molecular Biology, supra). 

To identify polymorphisms in the gene that are associated with a decreased 
susceptibility to a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease a test sample of DNA is obtained from the individual. PCR 

20 can be used to amplify all or a fragment of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS), and its flanking sequences. The DNA 
containing the amplified HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (or a fragment of any of those genes) is dot-blotted, using standard 
methods (see Current Protocols in Molecular Biology, supra), and the blot is 

25 contacted with the oligonucleotide probe. The presence of specific hybridization of 
the probe to the amplified HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a ( ANLS) , or 
HDRP(ANLS) is then detected. Specific hybridization of an allele-specific 
oligonucleotide probe to DNA from the individual is indicative of a polymorphism 
in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRPf ANLS), and is 

30 therefore indicative of a decreased susceptibility to a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. 
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In another embodiment, arrays of oligonucleotide probes that are 
complementary to target nucleic acid sequence segments from an individual, can be 
used to identify polymorphisms in HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). For example, in one embodiment, an 
5 oligonucleotide array can be used. Oligonucleotide arrays typically comprise a 
plurality of different oligonucleotide probes that are coupled to a surface of a 
substrate in different known locations. These oligonucleotide arrays, also described 
as "GENBCHIPS™," have been generally described in the art, for example, U.S. 
Patent No. 5,143,854 and PCT patent publication Nos. WO 90/15070 and 92/10092. 
1 0 These arrays can generally be produced using mechanical synthesis methods or light 
directed synthesis methods that incorporate a combination of photolithographic 
methods and solid phase oligonucleotide synthesis methods. See Fodor et al, 
Science, 251:767-777 (1991), Pirrung et al, U.S. Patent No. 5,143,854; PCT 
Publication No. WO 90/15070; Fodor et al, PCT Publication No. WO 92/10092, 
15 and U.S. Patent No. 5,424,186, the entire teachings of each of which are 

incorporated by reference herein. Techniques for the synthesis of these arrays using 
mechanical synthesis methods are described in, e.g. s U.S. Patent No. 5,384,261, the 
entire teachings of which are incorporated by reference herein. 

Once an oligonucleotide array is prepared, a nucleic acid of interest is 
20 hybridized to the array and scanned for polymorphisms. Hybridization and scanning 
are generally carried out by methods described herein and also in, e.g., Published 
PCT Application Nos. WO 92/10092 and WO 95/1 1995, and U.S. Patent No. 
5,424,186, the entire teachings of which are incorporated by reference herein. In 
brief, a target nucleic acid sequence that includes one or more previously identified 
25 polymorphic markers is amplified by well known amplification techniques, e.g. , 
PCR. Typically, this involves the use of primer sequences that are complementary 
to the two strands of the target sequence both upstream and downstream from the 
polymorphism. Asymmetric PCR techniques may also be used. Amplified target, 
generally incorporating a label, is then hybridized with the array under appropriate 
30 conditions. Upon completion of hybridization and washing of the array, the array is 
scanned to determine the position on the array to which the target sequence 
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hybridizes. The hybridization data obtained from the scan is typically in the form of 
fluorescence intensities as a function of location on the array. 

Although primarily described in terms of a single detection block, e.g., for 
detection of a single polymorphism, arrays can include multiple detection blocks, 
5 and thus be capable of analyzing multiple, specific polymorphisms. In alternate 
arrangements, it will generally be understood that detection blocks may be grouped 
within a single array or in multiple, separate arrays so that varying, optimal 
conditions may be used during the hybridization of the target to the array. For 
example, it may often be desirable to provide for the detection of those 

10 polymorphisms that fall within G-C rich stretches of a genomic sequence, separately 
from those falling in A-T rich segments. This allows for the separate optimization 
of hybridization conditions for each situation. 

Additional descriptions of the use of oligonucleotide arrays for detection of 
polymorphisms can be found, for example, in U.S. Patent Nos. 5,858,659 and 

15 5,837,832, the entire teachings of which are incorporated by reference herein. 

Other methods of nucleic acid analysis can be used to detect polymorphisms 
in HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or 
variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS). Representative methods include direct manual sequencing (Church 

20 and Gilbert Proc. Natl. Acad. Sci. USA 81 : 1991-1995, (1988); Sanger et al, Proc. 
Natl. Acad. Sci. 74: 5463-5467 (1977); Beavis et al, U.S. Patent No. 5,288,644); 
automated fluorescent sequencing; single-stranded conformation polymorphism 
assays (SSCP); clamped denaturing gel electrophoresis (CDGE); denaturing gradient 
gel electrophoresis (DGGE) (Sheffield et al, Proc. Natl. Acad. Sci. USA 86: 

25 232-236 (1991)), mobility shift analysis (Orita et al, Proc. Natl. Acad. Sci. USA 86: 
2766-2770 (1989)), restriction enzyme analysis (Flavell et al, Cell 15: 25 (1978); 
Geever, et al, Proc. Natl. Acad. Sci. USA 78: 5081 (1981)); heteroduplex analysis; 
chemical mismatch cleavage (CMC) (Cotton et al, Proc. Natl. Acad. Sci. USA 85: 
4397-4401 (1985)); RNase protection assays (Myers et al, Science 230: 1242 

30 (1985)); use of polypeptides that recognize nucleotide mismatches, such as E. coli 
mutS protein; and allele-specific PCR. 
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In another embodiment of the invention, diagnosis of a susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease can also 
be made by examining the level of an HDAC9, HDAC9a, HDAC9(ANLS) 
HDA C9a(ANLS), or HDRP(ANLS) nucleicacid, for example, usinginsitu 
5 hybridization techniques known to one skilled in the art, or by examining the level of 
expression, activity, and/or composition of an HDAC9, HDAC9a, HDAC9(ANLS) 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide, by a variety of methods, including 
enzyme linked immunosorbent assays (ELISAs), Western blots, 
immunoprecipitations, immunohistochemistry, and immunofluorescence. A test 
) sample from an individual is assessed for the presence of an alteration in the level of 
an HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP (ANLS) nucleic 
acid or m the expression and/or an alteration in composition of the polypeptide 
encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), 
or for the presence of a particular variant encoded by HDAC9, HDAC9a 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP (ANLS). An alteration in expression of a 
polypeptide encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) can be, for example, an alteration in the quantitative polypeptide 
expression (i.e., the amount of polypeptide produced); an alteration in the 
composition of a polypeptide encoded by HDAC9, HDAC9a, HDAC9(ANLS) 
HDA C9a(ANLS), or HDRP(ANLS), or an alteration in the qualitative polypeptide 
expression (e.g., expression of a mutant HDAC9, HDAC9a, HDAC9(ANLS) 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide or variant thereof). In a preferred 
embodiment, diagnosis of a susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease is made by detecting a particular variant 
encoded hyHDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
or aparticular pattern of variants. Preferably, increased levels of HDA C9 HDAC9a 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or increased expression or 
activity of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, relative to a control sample, for example'a sample 
known not to be associated with a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease, indicates an increased susceptibility or likelihood that 
the individual has a cell prohferation disease, an apoptotic disease, or a cell 
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differentiation disease. Alternatively, decreased levels of HDAC9, HDAC9a, 
HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) or decreased expression or 
activity of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, relative to a control sample, for example, a sample 
5 known not to be associated with a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease, indicates a decreased susceptibility or likelihood that 
the individual has a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

Both quantitative and qualitative alterations can also be present. An 

1 0 "alteration" or "modulation" in the polypeptide expression, activity, or composition, 
as used herein, refers to an alteration in expression or composition in a test sample, 
as compared with the expression or composition of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a control 
sample. A control sample is a sample that corresponds to the test sample (e.g., is 

15 from the same type of cells), and is from an individual who is not affected by a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. An 
alteration in the expression or composition of the polypeptide in the test sample, as 
compared with the control sample, is indicative of a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. 

20 Similarly, the presence of one or more different variants in the test sample, or the 
presence of significantly different amounts of different variants in the test sample, as 
compared with the control sample, is indicative of a decreased susceptibility to a cell 
proliferation disease, an apoptotic disease, or a cell differentiation disease. 

It is understood that alterations or modulations in polypeptide expression or 

25 function can occur in varying degrees. For example, an alteration or modulation in 
expression can be an increase, for example, by at least 1.5-fold to 2-fold, at least 3- 
fold, or, at least 5-fold, relative to the control. Alternatively, the alteration or 
modulation in polypeptide expression can be a decrease, for example, by at least 
10%, at least 40%, 50%, or 75%, or by at least 90%, relative to the control. 

30 Various means of examining expression or composition of the HDAC9, 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide can be 
used, including spectroscopy, colorimetry, electrophoresis, isoelectric focusing, and 
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immunoassays (e.g., David etal, U.S. Patent No. 4,376,110) such as 
immunoblotting (see also Ausubel et al, supra; particularly chapter 10). For 
example, in one embodiment, an antibody capable of binding to the polypeptide 
{e.g., as described above), preferably an antibody with a detectable label, can be 
5 used. Antibodies can be polyclonal, or more preferably, monoclonal. An intact 
antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be used. The term 
"labeled," with regard to the antibody, is intended to encompass direct labeling of 
the antibody by coupling (i.e., physically linking) a detectable substance to the 
antibody, as well as indirect labeling of the antibody by reacting it with another 
1 0 reagent that is directly labeled. An example of indirect labeling is detection of a 
primary antibody using a fluorescently labeled secondary antibody. 

Western blotting analysis, using an antibody as described above that 
specifically binds to a mutant HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or an antibody that specifically 
1 5 binds to a non-mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an antibody that specifically binds to a particular 
variant encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS), can be used to identify the presence in a test sample of a particular 
variant of a polypeptide encoded by a polymorphic or mutant HDAC9, HDAC9a, 
20 HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS), or the absence in a test sample 
of a particular variant or of a polypeptide encoded by a non-polymorphic or 
non-mutant gene. The presence of a polypeptide encoded by a polymorphic or 
mutant gene, or the absence of a polypeptide encoded by a non-polymorphic or 
non-mutant gene, is diagnostic for a decreased susceptibility to a cell proliferation 
25 disease, an apoptotic disease, or a cell differentiation disease, as is the presence (or 
absence) of particular variants encoded by the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), oxHDRP(ANLS) nucleic acid molecule. 

Li one embodiment of this method, the level or amount of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a test 
30 sample is compared with the level or amount of the HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide in a control 
sample. A level or amount of the polypeptide in the test sample that is higher or 
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lower than the level or amount of the polypeptide in the control sample, such that the 
difference is statistically significant, is indicative of an alteration in the expression of 
the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, and is diagnostic for a decreased susceptibility to a cell proliferation 
5 disease, an apoptotic disease, or a cell differentiation disease. 

Alternatively, the composition of the HDAC9, HDAC9a, HDAC9(ANLS), 
KDAC9a(ANLS), or HDRP(ANLS) polypeptide in a test sample is compared with 
the composition of the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide in a control sample. A difference in the composition of 

10 the polypeptide in the test sample, as compared with the composition of the 
polypeptide in the control sample (e.g., the presence of different variants), is 
diagnostic for a decreased susceptibility to a cell proliferation disease, an apoptotic 
disease, or a cell differentiation disease. In another embodiment, both the level or 
amount and the composition of the polypeptide can be assessed in the test sample 

15 and in the control sample. A difference in the amount or level of the polypeptide in 
the test sample, compared to the control sample; a difference in composition in the 
test sample, compared to the control sample; or both a difference in the amount or 
level, and a difference in the composition, is indicative of a decreased susceptibility 
to a cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 

20 Kits (e.g. , reagent kits) useful in the methods of diagnosis comprise 

components useful in any of the methods described herein, including, for example, 
hybridization probes or primers as described herein (e.g., labeled probes or primers), 
reagents for detection of labeled molecules, restriction enzymes (eg., for RFLP 
analysis), allele-specific oligonucleotides, antibodies that bind to a mutant or to 

25 non-mutant (native) HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, means for amplification of nucleic acids comprising 
HDAC9, HDAC9a y HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS), or means 
for analyzing the nucleic acid sequence of HDAC9, HDAC9a 9 HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS), or for analyzing the amino acid sequence of an 

30 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, etc. 
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SCREENING ASSAYS AND AGENTS IDENTIFIED THEREBY 

The invention provides methods (also referred to herein as "screening 
assays") for identifying the presence of a nucleotide that hybridizes to a nucleic acid 
of the invention, as well as for identifying the presence of a polypeptide encoded by 
5 a nucleic acid of the invention, hi one embodiment, the presence (or absence) of a 
nucleic acid molecule of interest (e.g., a nucleic acid that has significant homology 
with a nucleic acid of HDAC9, HDACPa, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS)) in a sample can be assessed by contacting the sample with a nucleic 
acid comprising a nucleic acid of the invention {e.g., a nucleic acid having the 
10 sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ 
ID NO: 9, which may optionally comprise at least one polymorphism, or the 
complement thereof, or a nucleic acid encoding an amino acid having the sequence 
of SEQ ID NO: 2, SEQ ID NO:4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 
10, or a fragment or variant of such nucleic acids), under stringent conditions as 
15 described above, and then assessing the sample for the presence (or absence) of 
hybridization. In a preferred embodiment, high stringency conditions are conditions 
appropriate for selective hybridization. In another embodiment, a sample containing 
the nucleic acid molecule of interest is contacted with a nucleic acid containing a 
contiguous nucleotide sequence (e.g. , a primer or a probe as described above) that is 
20 at least partially complementary to a part of the nucleic acid molecule of interest 
(e.g., an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
nucleic acid), and the contacted sample is assessed for the presence or absence of 
hybridization. In a preferred embodiment, the nucleic acid containing a contiguous 
nucleotide sequence is completely complementary to a part of the nucleic acid 
25 molecule of HDAC9, HDACPa, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS). 

In any of the above embodiments, all or a portion of the nucleic acid of 
interest can be subjected to amplification prior to performing the hybridization. 

In another embodiment, the presence (or absence) of an HDAC9, HDAC9a, 
30 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, such as a 
polypeptide of the invention or a fragment or variant thereof, in a sample can be 
assessed by contacting the sample with an antibody that specifically binds to the 
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polypeptide of HDAC9, KDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (e.g., an antibody such as those described above), and then assessing 
the sample for the presence (or absence) of binding of the antibody to the HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide. 
5 In another embodiment, the invention provides methods for identifying 

agents or compounds (e.g., fusion proteins, polypeptides, peptidomimetics, 
prodrugs, receptors, binding agents, antibodies, small molecules or other drugs, or 
ribozymes) that alter or modulate (e.g., increase or decrease) the activity of the 
polypeptides described herein, or that otherwise interact with the polypeptides 

10 herein. For example, such compounds can be compounds or agents that bind to 
polypeptides described herein (e.g., HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) substrates or agents); that have a stimulatory or 
inhibitory effect on, for example, activity of polypeptides of the invention; or that 
change (e.g., enhance or inhibit) the ability of the polypeptides of the invention to 

1 5 interact with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) binding agents; or that alter post-translational processing of the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide (e.g., agents that alter proteolytic processing to direct the polypeptide 
from where it is normally synthesized to another location in the cell, such as the cell 

20 surface; or agents that alter proteolytic processing such that more polypeptide is 

released from the cell, etc.). In one example, the binding agent is a cell proliferation 
disease binding agent, an apoptotic disease binding agent, or a cell differentiation 
disease binding agent. As used herein, by a "cell proliferation disease binding 
agent," an "apoptotic disease binding agent," or a "cell differentiation disease 

25 binding agent" is meant an agent as described herein that binds to a polypeptide of 
the present invention and modulates a cell proliferation disease, an apoptotic disease, 
or a cell differentiation disease. The modulation can be an increase or a decrease in 
the severity or progression of the disease. In addition, a cell proliferation disease 
binding agent, an apoptotic disease binding agent, or a cell differentiation disease 

30 binding agent includes an agent that binds to a polypeptide that is upstream (earlier) 
or downstream (later) of the cell signaling events mediated by a polypeptide of the 
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present invention, and thereby modulates the overall activity of the signaling 
pathway; in turn, the disease state is modulated. 

The candidate compound can cause an increase in the activity of the 
polypeptide. For example, the activity of the polypeptide can be increased by at least 
5 1.5-fold to 2-fold, at least 3-fold, or, at least 5-fold, relative to the control. 

Alternatively, the polypeptide activity can be a decrease, for example, by at least 
10%, at least 20%, 40%, 50%, or 75%, or by at least 90%, relative to the control. 

In one embodiment, the invention provides assays for screening candidate 
compounds or test agents to identify compounds that bind to or modulate the activity 
1 0 of polypeptides described herein (or biologically active portions) thereof), as well as 
agents identifiable by the assays. As used herein, a "candidate compound" or "test 
agent" is a chemical molecule, be it naturally-occurring or artificially-derived, and 
includes, for example, peptides, proteins, synthesized molecules, for example,' 
synthetic organic molecules, naturally-occurring molecule, for example, naturally 
15 occurring organic molecules, nucleic acid molecules, and components thereof. 

In general, candidate compounds for uses in the present invention may be 
identified from large libraries of natural products or synthetic (or semi-synthetic) 
extracts or chemical libraries according to methods known in the art. Those skilled 
in the field of drug discovery and development will understand that the precise 
20 source of test extracts or compounds is not critical to the screening procedure(s) of 
the invention. Accordingly, virtually any number of chemical extracts or compounds 
can be screened using the exemplary methods described herein. Examples of such 
extracts or compounds include, but are not limited to, plant-, fungal-, prokaryotic- or 
animal-based extracts, fermentation broths, and synthetic compounds, as well as 
25 modification of existing compounds. Numerous methods are also available for 
generating random or directed synthesis (e.g., semi-synthesis or total synthesis) of 
any number of chemical compounds, including, but not limited to, saccharide-, 
lipid-, peptide-, and nucleic acid-based compounds. Synthetic compound libraries 
are commercially available, e.g., from Brandon Associates (Merrimack, NH) and 
Aldrich Chemical (Milwaukee, WI). Alternatively, libraries of natural compounds 
m the form of bacterial, fungal, plant, and animal extracts are commercially 
available from a number of sources, including Biotics (Sussex, UK), Xenova 



30 
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(Slough, UK), Harbor Branch Oceangraphics Institute (Ft. Pierce, FL), and 
PharmaMar, U.S.A. (Cambridge, MA). In addition, natural and synthetically 
produced libraries are generated, if desired, according to methods known in the art, 
e.g., by standard extraction and fractionation methods. For example, candidate 
5 compounds can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially 
addressable parallel solid phase or solution phase libraries; synthetic library methods 
requiring deconvolution; the "one-bead one-compound" library method; and 
synthetic library methods using affinity chromatography selection. The biological 

10 library approach is limited to polypeptide libraries, while the other four approaches 
are applicable to polypeptide, non-peptide oligomer or small molecule libraries of 
compounds (Lam, Anticancer Drug Des., 12: 145 (1997)). Furthermore, if desired, 
any library or compound is readily modified using standard chemical, physical, or 
biochemical methods. 

15 In addition, those skilled in the art of drug discovery and development 

readily understand that methods for dereplication (e.g., taxonomic dereplication, 
biological dereplication, and chemical dereplication, or any combination thereof) or 
the elimination of replicates or repeats of materials already known for their activities 
should be employed whenever possible. 

20 When a crude extract is found to modulate (i.e., stimulate or inhibit) the 

expression and/or activity of the nucleic acids and or polypeptides of the present 
invention, further fractionation of the positive lead extract is necessary to isolate 
chemical constituents responsible for the observed effect. Thus, the goal of the 
extraction, fractionation, and purification process is the careful characterization and 

25 identification of a chemical entity within the crude extract having an activity that 
stimulates or inhibits nucleic acid expression, polypeptide expression, or polypeptide 
biological activity. The same assays described herein for the detection of activities 
in mixtures of compounds can be used to purify the active component and to test 
derivatives thereof Methods of fractionation and purification of such heterogenous 

30 extracts are known in the art. If desired, compounds shown to be useful agents for 
treatment are chemically modified according to methods known in the art. 
Compounds identified as being of therapeutic value may be subsequently analyzed 
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using animal models for diseases in which it is desirable to alter the activity or 
expression of the nucleic acids or polypeptides of Represent invention. 

In one embodiment, to identify candidate compounds that alter the biological 
activity, for example, the enzymatic activity or transcriptional repression activity of 
5 an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide, a cell, tissue, cell lysate, tissue Iysate, or solution containing or 
expressing an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide {e.g., SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SB 
ID NO: 8, SEQ ID NO: 10, or another variant encoded by HDAC9, HDAC9a 
10 HDAC9(ANLS),HDAC9a(ANLS), or HDRP(ANLS)), or a fragment or derivative 
thereof (as described above), can be contacted with a candidate compound to be 
tested under conditions suitable for enzymatic reaction or transcriptional repression 
reaction, as described herein. 

Alternatively, the polypeptide can be contacted directly with the candidate 
15 compound to be tested. The level (amount) of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) biological activity is assessed (e.g., the level 
(amount) of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) biological activity is measured, either directly or indirectly), and is 
compared with the level of biological activity in a control (i.e., the level of activity 
of the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or active fragment or derivative thereof in the absence of the candidate 
compound to be tested, or in the presence of the candidate compound vehicle only). 
If the level of the biological activity in the presence of the candidate compound 
differs, by an amount that is statistically significant, from the level of the biological 
activity in the absence of the candidate compound, or in the presence of the 
candidate compound vehicle only, then the candidate compound is a compound that 
alters the biological activity of an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide. For example, an increase in the 
level of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
enzymatic or transcriptional repression activity relative to a control, indicates that 
the candidate compound is a compound that enhances (is an agonist of) HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity. Similarly, 



20 
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a decrease in the enzymatic level or transcriptional repression level of HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity relative to a 
control, indicates that the candidate compound is a compound that inhibits (is an 
antagonist of) HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP(ANLS) activity. In another embodiment, the level of biological activity of an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or derivative or fragment thereof in the presence of the candidate 
compound to be tested, is compared with a control level that has previously been 
established. A level of the biological activity in the presence of the candidate 
10 compound that differs from the control level by an amount that is statistically 

significant indicates that the compound alters HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) biological activity. 

The present invention also relates to an assay for identifying compounds that 
alter the expression of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
15 HDRP(ANLS) nucleic acid molecule (e.g., antisense nucleic acids, fusion proteins, 
polypeptides, peptidomimetics, prodrugs, receptors, binding agents, antibodies, small 
molecules or other drugs, or ribozymes) that alter (e.g., increase or decrease) 
expression (e.g., transcription or translation) of the nucleic acid molecule or that 
otherwise interact with the nucleic acids described herein, as well as compounds 
20 identifiable by the assays. For example, a solution containing a nucleic acid 
encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide can be contacted with a candidate compound to be tested. 
The solution can comprise, for example, cells containing the nucleic acid or cell 
lysate containing the nucleic acid; alternatively, the solution can be another solution 
25 that comprises elements necessary for transcription/translation of the nucleic acid. 
Cells not suspended in solution can also be employed, if desired. The level and/or 
pattern of HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 
expression (e.g., the level and/or pattern of mRNA or of protein expressed, such as 
the level and/or pattern of different variants) is assessed, and is compared with the 
30 level and/or pattern of expression in a control (i.e., the level and/or pattern of 

HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) expression in 
the absence of the candidate compound, or in the presence of the candidate 
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compos vehicle only). If to level ^ pattem „ ^ ^ ^ 
expound d»«, by m „ or „ . mame , ^ fc 

^■^P-"h*.*«. rf fc«« ll . ocllW)IB4< . hto 
of the candidate compound vehicle only, then .he candidate compound is a 
5 compound tha, alto the expression of «a<c«, fflMCSa, HDAOXimS) 
HDAC9a(ANLS), or HDSP(ANLS). Enhancement of HZMCS fflMCSa ' 

auoranft orm, mMLS) expressim ^'^^ 

candidate compound is an agonist of HDAC9, HDAC9a, HDAC9(ANLS) 
HDAC9a(ANLS) ) or HDRP(ANLS) activity. Similarly, inhihidon of HDAC9 

that me candidate compound ie an antagonist of HDAC9, HDAC9a, HDAC9(ANLS) 
HDAC9a(ANLS), or HDRP(ANLS) activity, m anomer embodiment me level 
and/or pattern of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) or 
HDR^ANIS) polypepttde( s ) (, g „ different vaiiants) fa ^ presence of me' 
candidate compound to be tested, is compared with a control level and/or pattern that 
has previously been established. A level and/or pattern in the presence of the 
candtdate compound ft* differs from the control level and/or pattern by an amount 
or m a manner is statically si^Mcm indicates fc, ft* candiaate 
altera fflXCP, fflUCPu, flMo^ HDAC^ANLS), orHDRFfANLS, 
20 expression. 

In another embodiment of the invention, compounds that alter the expression 
^^9,mAC9a,mAC9 ( ^L S) , HD A CM ^ LS) ^ HDmiNLSj 
nuclete add molecule or that othenvise interact with the nucleic acida described 
heretn, can be ideatifl ed using a cell, cell lyaafc, or solution containing a nucleic 
aetd encoding the promoter region of the HDAC9, HDAC9a, HDAC9(ANLS) 
HDAC9a«WLS>, orHDRPfANLS, gene operab,y hnked to a reporter gene. After 
contact witt, a candidate compound to be tested, the level of expression of the 
reporter gene («,, «be level ofmRNA orofprotoin expressed) is assessed, and la 
compared with the level of expression in a control tbe ,eve. of me expmsston 
of ». reporter gene in the absence of the candidate compound, or in the presence of 
the candtdate compound vehicle only). If the level m the presence of the candidate 
compound differs, by an amount „ r in a manner dtat „ stattstically slgtnicant, fr„ m 
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the level in the absence of the candidate compound, or in the presence of the 
candidate compound vehicle only, then the candidate compound is a compound that 
alters the expression of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), as indicated by its ability to alter expression of a gene that is 
5 operably linked to the HDAC9, HDAC9a > HDAC9(ANLS), HDA C9a ( ANLS) , or 
HDRP(ANLS) gene promoter. Enhancement of the expression of the reporter 
indicates that the compound is an agonist of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) activity. Similarly, inhibition of the expression 
of the reporter indicates that the compound is an antagonist of HDAC9, HDAC9a, 

10 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) activity. In another 

embodiment, the level of expression of the reporter in the presence of the candidate 
compound to be tested, is compared with a control level that has previously been 
established. A level in the presence of the candidate compound that differs from the 
control level by an amount or in a manner that is statistically significant indicates 

1 5 that the candidate compound alters HDAC9, HDAC9a 9 HDAC9(ANLS) 9 
HDAC9a(ANLS), or HDRP(ANLS) expression. 

Compounds that alter the amounts of different variants encoded by HDAC9, 
HDAC9a, HDA C9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) (e.g., a compound 
that enhances activity of a first variant, and that inhibits activity of a second variant), 

20 as well as compounds that are agonists of activity of a first variant and antagonists 
of activity of a second variant, can easily be identified using these methods 
described above. 

In other embodiments of the invention, assays can be used to assess the 
impact of a candidate compound on the activity of a polypeptide in relation to an 

25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate, 
for example, an inhibitor of histone deacetylase activity. These inhibitors fall into 
four general classes: 1) short-chain fatty acids (e.g., 4-phenylbutyrate and valproic 
acid); 2) hydroxamic acids (e.g. 9 SAHA, Pyroxamide, trichostatin A (TSA), 
oxamflatin and CHAPs, such as, CHAP1 and CHAP 31); 3) cyclic tetrapeptides 

30 (Trapoxin A, Apicidin and Depsipeptide (FK-228, also known as FR901 1228); 4) 
benzamides (e.g., MS-275); and other compounds such as Scriptaid. Examples of 
such assays and compounds can be found in U.S. Patent Nos. 5,369,108, issued on 
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November 29, 1994, 5,700,81 1, issued on December 23, 1997, and 5,773 474 
issued on June 30, 1998 to Breslow et al, U.S. Patent Nos. 5,055,608, issued on 
October 8, 1991, and 5,175,191, issued on December 29, 1992 to Marks et al as 
well as, Yoshida et al , supra; Saito et al, supra; Furamai et al, supra; Komaisu et 
5 al. , supra; Su et al, supra; Lee et al, supra and Suzuki et al supra, the entire 
content of all of which are hereby incorporated by reference. 

In one example, a cell or tissue that expresses or contains a compound that 
interacts with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) (herein referred to as an "HDAC9, HDAC9a, HDAC9(ANLS) 
10 ^AC9a(ANU»,orHD m ^ 

molecule that interacts with HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS) 
or HDRP(ANLS)) is contacted with HDAC9, HDAC9a, HDAC9(ANLS) 
HDAC9a(ANLS), or HDRP(ANLS) in the presence of a candidate compound and 
the abihty of the candidate compound to alter the interaction between HDAC9 
1 5 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) and the HDAC9 
HDAG9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP (ANLS) substrate is 
determined, for example, by assaying activity of the polypeptide. Alternatively, a 
cell lysate or a solution containing the HDAC9, HDAC9a, HDAC9(ANLS) 
HDAC9a(ANLS), or HDRP(ANLS) substrate, can be used. A compound that binds 
» to HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate 
can alter the mteraction by interfering with, or enhancing the abihty of HDAC9 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) to bind to, associate 
with, or otherwise interact with the HDAC9, HDAC9a, HDAC9(ANLS) 
5 HDAC9a(ANLS), or HDRP(ANLS) substrate. 

Detennining the ability of the candidate compound to bind to HDAC9 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or an HDAC9 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate can be 
accomplished, for example, by coupling the candidate compound with a 
radioisotope or enzymatic label such that binding of the candidate compound to the 
polypeptide can be determined by detecting the labeled with ,2S I, «S, "C, or 3 H 
either directly or indirectly, and the radioisotope detected by direct counting of 
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radioemmission or by scintillation counting. Alternatively, candidate compound can 
be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of 
conversion of an appropriate substrate to product. 
5 It is also within the scope of this invention to determine the ability of a 

candidate compound to interact with the polypeptide without the labeling of any of 
the interactants. For example, a microphysiometer can be used to detect the 
interaction of a candidate compound with HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) or an HDAC9, HDAC9a, HDAC9(ANLS), 

10 HDAC9a(ANLS), or HDRP(ANLS) substrate without the labeling of either the 
candidate compound, HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), or the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) substrate (McConnell et a!. 9 (1992) Science, 257: 1906-1912). As 
used herein, a "microphysiometer" (e.g., CYTOSENSOR™) is an analytical 

15 instrument that measures the rate at which a cell acidifies its environment using a 
light-addressable potentiometric sensor (LAPS). Changes in this acidification rate 
can be used as an indicator of the interaction between ligand and polypeptide. 

In another embodiment of the invention, assays can be used to identify 
polypeptides that interact with one or more HDAC9, HDAC9a, HDAC9(ANLS), 

20 HDAC9a(ANLS), or HDRP(ANLS) polypeptides, as described herein. For example, 
a yeast two-hybrid system such as that described by Fields and Song (Fields and 
Song, Nature 340: 245-246 (1989)) can be used to identify polypeptides that interact 
with one or more HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptides. In such a yeast two-hybrid system, vectors are 

25 constructed based on the flexibility of a transcription factor that has two functional 
domains (a DNA binding domain and a transcription activation domain). If the two 
domains are separated but fused to two different proteins that interact with one 
another, transcriptional activation can be achieved, and transcription of specific 
markers (e.g., nutritional markers such as His and Ade, or color markers such as 

30 lacZ) can be used to identify the presence of interaction and transcriptional 

activation. For example, in the methods of the invention, a first vector is used that 
includes a nucleic acid encoding a DNA binding domain and an HDAC9, HDAC9a, 



WO 02/102984 



-67- 



PCT/US02/19051 



10 



15 



HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, variant, or 
fragment or derivative thereof, and a second vector is used that includes a nucleic 
acid encoding a transcription activation domain and a nucleic acid encoding a 
polypeptide that potentially may interact with the HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, variant, or 
fragment or derivative thereof {e.g., an HDAC9, HDAC9a, HDAC9(ANLS) 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide substrate or receptor). Incubation 
of yeast containing the first vector and the second vector under appropriate 
conditions {e.g., mating conditions such as used in me MATCHMAKER™ system 
from Clontech) allows identification of colonies that express the markers of 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS). These 
colonies can be examined to identify the polypeptides) that interact with the 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or fragment or derivative thereof. Such polypeptides may be useful as 
compounds that alter the activity or expression of an HDAC9, HDAC9a, 

HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, as described 
above 



20 



25 



30 



In more than one embodiment of the above assay methods of the present 
invention, it may be desirable to immobilize an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, oranHDAC9 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate, or other 
components of the assay on a solid support, in order to facilitate separation of 
complexed from uncomplexed forms of one or both of the polypeptides, as well as 
to accommodate automation of the assay. Binding of a candidate compound to the 
polypeptide, or interaction of the polypeptide with a substrate in the presence and 
absence of a candidate compound, can be accomplished in any vessel suitable for 
containing the reactants. Examples of such vessels include microtitre plates, test 
tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein {e.g, a 
glutathione-S-transferase fusion protein) can be provided that adds a domain 'that 
allows HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) or 
an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(AMLS), or HDRP(ANLS) 
substrate to be bound to a matrix or other solid support. 



WO 02/102984 



-68- 



PCT/US02/I9051 



In another embodiment, modulators of expression of nucleic acid molecules 
of the invention are identified in a method wherein a cell, cell lysate, tissue, tissue 
lysate, or solution containing a nucleic acid encoding HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is contacted with a candidate 
5 compound and the expression of appropriate mRNA or polypeptide {e.g., variant(s)) 
in the cell, cell lysate, tissue, or tissue lysate, or solution, is determined. The level 
of expression of appropriate mRNA or polypeptide(s) in the presence of the 
candidate compound is compared to the level of expression of mRNA or 
polypeptide(s) in the absence of the candidate compound, or in the presence of the 

10 candidate compound vehicle only. The candidate compound can then be identified 
as a modulator of expression based on this comparison. For example, when 
expression of mRNA or polypeptide is greater (statistically significantly greater) in 
the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator or enhancer of the mRNA or polypeptide 

15 expression. Alternatively, when expression of the mRNA or polypeptide is less 

(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of the mRNA or 
polypeptide expression. The level of mRNA or polypeptide expression in the cells 
can be determined by methods described herein for detecting mRNA or polypeptide. 

20 This invention further pertains to novel compounds identified by the 

above-described screening assays. Accordingly, it is within the scope of this 
invention to further use a compound identified as described herein in an appropriate 
animal model. For example, a compound identified as described herein (e.g., a 
candidate compound that is a modulating compound such as an antisense nucleic 

25 acid molecule, a specific antibody, or a polypeptide substrate) can be used in an 
animal model to determine the efficacy, toxicity, or side effects of treatment with 
such a compound. Alternatively, a compound identified as described herein can be 
used in an animal model to determine the mechanism of action of such a compound. 
Furthermore, this invention pertains to uses of novel compounds identified by the 

30 above-described screening assays for treatments as described herein. In addition, a 
compound identified as described herein can be used to alter activity of an HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or to 
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alter expression of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), by contacting the polypeptide or the nucleic acid molecule (or 
contacting a cell comprising the polypeptide or the nucleic acid molecule) with the 
compound identified as described herein. 



5 



PHARMACEUTICAL COMPOSITIONS 

The present invention also pertains to pharmaceutical compositions 
comprising nucleic acids described herein, particularly nucleotides encoding the 
polypeptides described herein; comprising polypeptides described herein (e.g., SEQ 
10 ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID NO:10, and/or 
other variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS)); and/or comprising a compound that altera (e.g., increases or 
decreases) HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
expression or HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
15 HDRP(ANLS) polypeptide activity as described herein. For instance, a polypeptide, 
protein, fragment, fusion protein or prodrug thereof, or a nucleotide or nucleic acid 
construct (vector) comprising a nucleotide of the present invention, a compound that 
alters HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide activity, a compound that alters HDAC9, HDAC9a, HDAC9(ANLS), 
20 /^Cito(/INW; fOT ^ 

HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrate or binding 
partner, can be formulated with a physiologically acceptable carrier or excipient to 
prepare a pharmaceutical composition. The carrier and composition can be sterile. 
The formulation should suit the mode of administration. 
25 Suitable pharmaceutical^ acceptable carriers include but are not limited to 

water, salt solutions {e.g., NaCl), saline, buffered saline, alcohols, glycerol, ethanol, 
gum arabic, vegetable oils, benzyl alcohols, polyethylene glycols, gelatin, 
carbohydrates such as lactose, amylose or starch, dextrose, magnesium stearate, talc, 
silicic acid, viscous paraffin, perfume oil, fatty acid esters, hydroxymethylcellulose, 
30 polyvinyl pyrolidone, etc., as well as combinations thereof. The pharmaceutical 
preparations can, if desired, be mixed with auxiliary agents, e.g., lubricants, 
preservatives, stabilizers, wetting agents, emulsifiers, salts for influencing osmotic 
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pressure, buffers, coloring, flavoring and/or aromatic substances and the like that do 
not deleteriously react with the active compounds. 

The composition, if desired, can also contain minor amounts of wetting or 
emulsifying agents, or pH buffering agents. The composition can be a liquid 
5 solution, suspension, emulsion, tablet, pill, capsule, sustained release formulation, 
or powder. The composition can be formulated as a suppository, with traditional 
binders and carriers such as triglycerides. Oral formulation can include standard 
carriers such as pharmaceutical grades of mannitol, lactose, starch, magnesium 
stearate, polyvinyl pyrollidone, sodium saccharine, cellulose, magnesium carbonate, 
10 etc. 

Methods of introduction of these compositions include, but are not limited 
to, intradermal, intramuscular, intraperitoneal, intraocular, intravenous, 
subcutaneous, topical, oral and intranasal. Other suitable methods of introduction 
can also include gene therapy (as described below), rechargeable or biodegradable 

15 devices, particle acceleration devises ("gene guns") and slow release polymeric 
devices. The pharmaceutical compositions of this invention can also be 
administered as part of a combinatorial therapy with other compounds. 

The composition can be formulated in accordance with the routine 
procedures as a pharmaceutical composition adapted for administration to human 

20 beings. For example, compositions for intravenous administration typically are 

solutions in sterile isotonic aqueous buffer. Where necessary, the composition may 
i also include a solubilizing agent and a local anesthetic to ease pain at the site of the 
injection. Generally, the ingredients are supplied either separately or mixed together 
in unit dosage form, for example, as a dry lyophilized powder or water free 

25 concentrate in a hermetically sealed container such as an ampule or sachette 
indicating the quantity of active compound. Where the composition is to be 
administered by infusion, it can be dispensed with an infusion bottle containing 
sterile pharmaceutical grade water, saline or dextrose/water. Where the composition 
is administered by injection, an ampule of sterile water for injection or saline can be 

30 provided so that the ingredients may be mixed prior to administration. 

For topical application, nonsprayable forms, viscous to semi-solid or solid 
forms comprising a carrier compatible with topical application and having a 
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15 



20 



25 



30 



dynamic viscosity preferably greater than water, can be employed. Suitable 
formulations include but are not limited to solutions, suspensions, emulsions, 
creams, ointments, powders, enemas, lotions, sols, liniments, salves, aerosols, etc., 
that are, if desired, sterilized or mixed with auxiliary agents, e.g., preservatives, 
5 stabilizers, wetting agents, buffers or salts for influencing osmotic pressure, etc'. 
Thecompoundmaybeincorporatedinto acosmetic formulation. Fortopical 
application, also suitable are sprayable aerosol preparations wherein the active 
ingredient, preferably in combination with a solid or liquid inert carrier material, is 
packaged in a squeeze bottle or in admixture with a pressurized volatile, normally 
10 gaseous propellant, e.g., pressurized air. 

Compounds described herein can be formulated as neutral or salt forms. 
Pharmaceutically acceptable salts include those formed with free amino groups such 
as those derived from hydrochloric, phosphoric, acetic, oxalic, tartaric acids, etc., 
and those formed with free carboxyl groups such as those derived from sodium, ' 
potassium, ammonium, calcium, ferric hydroxides, isopropylamine, triethylamine, 
2-ethylamino ethanol, histidine, procaine, etc. 

The compounds are administered in a therapeutically effective amount. The 
amount of compounds that will be therapeutically effective in the treatment of a 
particular disorder or condition will depend on the nature of the disorder or 
condition, and can be determined by standard clinical techniques. Jn addition, in 
vitro or in vivo assays may optionally be employed to help identify optimal dosage 
ranges. The precise dose to be employed in the formulation will also depend on the 
route of administration, and the seriousness of the symptoms of a cell proliferation 
disease, an apoptotic disease, or a cell differentiation disease, and should be decided 
according to the judgment of a practitioner and each patient's circumstances. 
Effective doses may be extrapolated from dose-response curves derived from in 
vitro or animal model test systems. 

The invention also provides a pharmaceutical pack or kit comprising one or 
more containers filled with one or more of the ingredients of the pharmaceutical 
compositions of the invention. Optionally associated with such containers) can be 
a notice in the form prescribed by a governmental agency regulating the 
manufacture, use or sale of pharmaceuticals or biological products, that notice 
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reflects approval by the agency of manufacture, use of sale for human 
administration. The pack or kit can be labeled with information regarding mode of 
administration, sequence of drug administration (e.g., separately, sequentially or 
concurrently), or the like. The pack or kit may also include means for reminding the 
5 patient to take the therapy. The pack or kit can be a single unit dosage of the 
combination therapy or it can be a plurality of unit dosages. In particular, the 
compounds can be separated, mixed together in any combination, present in a single 
vial or tablet. Compounds assembled in a blister pack or other dispensing means is 
preferred. For the purpose of this invention, unit dosage is intended to mean a 
10 dosage that is dependent on the individual pharmacodynamics of each compound 
and administered in FDA approved dosages in standard time courses. 

METHODS OF THERAPY 

The present invention also pertains to methods of treatment (prophylactic, 

15 diagnostic, and/or therapeutic) for a cell proliferation disease, an apoptotic disease, 
or a cell differentiation disease, using an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) therapeutic compound. An "HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) therapeutic 
compound" is a compound that alters (e.g., enhances or inhibits) HDAC9, HDAC9a, 

20 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide activity and/or 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(WLS), or HDRP(ANLS) nucleic acid 
molecule expression, as described herein (e.g., an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) agonist or antagonist). 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

25 therapeutic compounds can alter HDAC9, HDAC9a, HDAC9(ANLS), 

HDAC9a(ANLS), or HDRP(ANLS) polypeptide activity or nucleic acid molecule 
expression by a variety of means, such as, for example, by providing additional 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or by upregulating the transcription or translation of the HDAC9, 

30 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid 
molecule; by altering post-translational processing of the HDAC9, HDAC9a, 
HDAC9(ANLS), HD AC9a(ANLS), or HDRP(ANLS) polypeptide; by altering 
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transcription of HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
HDRP(ANLS) variants; or by interfering with HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide activity (e.g., by binding to an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
5 polypeptide), or by downregulating the transcription or translation of the HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) nucleic acid 
molecule. Representative HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), 
or HDRP(ANLS) therapeutic compounds include the following: nucleic acids or 
fragments or derivatives thereof described herein, particularly nucleotides encoding 

1 0 the polypeptides described herein and vectors comprising such nucleic acids (eg. , a 
nucleic acid molecule, cDNA, and/or RNA, such as a nucleic acid encoding an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide or active fragment or derivative thereof, or an oligonucleotide; for 
example, SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID 

15 NO: 9, which may optionally comprise at least one polymorphism, or a nucleic acid 
encoding SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, SEQ ID 
NO: 10, or fragments or derivatives thereof); polypeptides described herein {e.g., 
SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8 SEQ ID NO: 10 
and/or other variants encoded by HDAC9, HDAC9a, HDAC9(ANLS), 

20 HDAC9a(ANLS), or HDRP(ANLS), or fragments or derivatives thereof); HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) substrates; 
peptidomimetics; fusion proteins or prodrugs thereof; antibodies (e.g., an antibody 
to a mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an antibody to a non-mutant HDAC9, HDAC9a, 

25 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide, or an antibody to 
a particular variant encoded by HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS), as described above); ribozymes; other small 
molecules; and other compounds that alter (e.g., enhance or inhibit) HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), oxHDRP(ANLS) nucleic acid 

30 expression or polypeptide activity, for example, those compounds identified in the 
screening methods described herein, or that regulate transcription of HDAC9, 
HDAC9a, HDAC9(MLS), HDA C9a(ANLS), or HDRP(ANLS) variants (e.g., 
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compounds that affect which variants are expressed, or that affect the amount of 
each variant that is expressed. More than one HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) therapeutic compound can be used 
concurrently, if desired. 
5 The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) therapeutic compound that is a nucleic acid is used in the treatment 
of a cell proliferation disease, an apoptotic disease, or a cell differentiation disease. 
The term, "treatment" as used herein, refers not only to ameliorating symptoms 
associated with the disease, but also preventing or delaying the onset of the disease, 

10 and also lessening the severity or frequency of symptoms of the disease. The 

therapy is designed to alter (e.g., inhibit or enhance), replace or supplement activity 
of an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide in an individual. For example, an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) therapeutic compound can be administered in 

1 5 order to upregulate or increase the expression or availability of the HDAC9, 

HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid molecule 
or of specific variants of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS), or, conversely, to downregulate or decrease the expression or 
availability of the HDAC9, HDAC9a y HDA C9(ANLS), HDA C9a(ANLS) 9 or 

20 HDRP(ANLS) nucleic acid molecule or specific variants of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS). Upregulation or increasing 
expression or availability of a native HDAC9, HDAC9a, HDAC9(ANLS) 9 
HDAC9a(ANLS), or HDRP(ANLS) nucleic acid molecule or of a particular variant 
could interfere with or compensate for the expression or activity of a defective gene 

25 or another variant; downregulation or decreasing expression or availability of a 
native HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
nucleic acid molecule or of a particular variant could minimize the expression or 
activity of a defective gene or the particular variant and thereby minimize the impact 
of the defective gene or the particular variant. 

30 The HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) therapeutic compound(s) are administered in a therapeutically 
effective amount (z.e., an amount that is sufficient to treat the disease, such as by 
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ameliorating symptoms associated with the disease, preventing or delaying the onset 
of the disease, and/or also lessening the severity or frequency of symptoms of the 
disease). The amount that will be therapeutically effective in the treatment of a 
particular individual's disorder or condition will depend on the symptoms and 
5 severity of the disease, and can be determined by standard clinical techniques. In 
addition, in vitro or in vivo assays may optionally be employed to help identify 
optimal dosage ranges. The precise dose to be employed in the formulation will 
also depend on the route of administration, and the seriousness of the disease or 
disorder, and should be decided according to the judgment of a practitioner and each 
10 patient's circumstances. Effective doses may be extrapolated from dose-response 
curves derived from in vitro or animal model test systems. 

In one embodiment, a nucleic acid of the invention (e.g., a nucleic acid 
encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDKP(ANLS) polypeptide, such as SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, 
1 5 SEQ ID NO: 7, or SEQ ID NO: 9, which may optionally comprise at least one 
polymorphism, or a nucleic acid that encodes an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide or a variant, 
derivative or fragment thereof, such as a nucleic acid encoding the protein of SEQ 
ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10) can 
20 be used, either alone or in a pharmaceutical composition as described above. For 
example, HDAC9 9 HDAC9a 9 HDAC9(ANLS) 9 HDA C9a(ANLS) 9 or HDRP(ANLS) or 
a cDNA encoding an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, either by itself or included within a vector, can be 
introduced into cells (either in vitro or in vivo) such that the cells produce native 
25 HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 

polypeptide. If desired, cells that have been transformed with the gene or cDNA or 
a vector comprising the gene or cDNA can be introduced (or re-introduced) into an 
individual affected with the disease. Thus, cells that, in nature, lack native HDAC9 9 
HDAC9a 9 HDAC9(ANLS) 9 HDA C9a(ANLS) 9 or HDRP(ANLS) expression and 
30 activity, or have mutant HDAC9, HDAC9a 9 HDAC9(ANLS) y HDAC9a(ANLS) 9 or 
HDRP(ANLS) expression and activity, or have expression of a disease-associated 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) variant, 
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can be engineered to express an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide or an active fragment of an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptide (or a different variant of an HDAC9, HDAC9a, HDAC9(ANLS), 
5 HDAC9a(ANLS), or HDRP(ANLS) polypeptide). In a preferred embodiment, 

nucleic acid encoding the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide, or an active fragment or derivative thereof, can be 
introduced into an expression vector, such as a viral vector, and the vector can be 
introduced into appropriate cells in an animal. Other gene transfer systems, 
10 including viral and nonviral transfer systems, can be used. Alternatively, nonviral 
gene transfer methods, such as calcium phosphate coprecipitation, mechanical 
techniques (e.g., microinjection); membrane fusion-mediated transfer via liposomes; 
or direct DNA uptake, can also be used to introduce the desired nucleic acid 
molecule into a cell. 

15 Alternatively, in another embodiment of the invention, a nucleic acid of the 

invention; a nucleic acid complementary to a nucleic acid of the invention; or a 
portion of such a nucleic acid (e.g., an oligonucleotide as described below), can be 
used in "antisense" therapy, in which a nucleic acid (e.g., an oligonucleotide) that 
specifically hybridizes to the RNA and/or genomic DNA of HDAC9, HDAC9a, 

20 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) is administered or generated in 
situ. The antisense nucleic acid that specifically hybridizes to the RNA and/or DNA 
inhibits expression of the HDAC9, HDAC9a, HDAC9(ANLS)> HDAC9a(ANLS), or 
HDRP(ANLS) nucleic acid molecule, e.g. y by inhibiting translation and/or 
transcription. Binding of the antisense nucleic acid can be by conventional base pair 

25 complementarity, or, for example, in the case of binding to DNA duplexes, through 
specific interaction in the major groove of the double helix. 

An antisense construct of the present invention can be delivered, for 
example, as an expression plasmid as described above. When the plasmid is 
transcribed in the cell, it produces RNA that is complementary to a portion of the 

30 mRNA and/or DNA that encodes an HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide. Alternatively, the antisense 
construct can be an oligonucleotide probe which is generated ex vivo and introduced 
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into cells; it then inhibits expression by hybridizing with the mRNA and/or genomic 
DNA of HDAC9, HDAC9a 9 HDAC9(ANLS) 9 HDAC9a(ANLS) 9 or HDRP(ANLS). In 
one embodiment, the oligonucleotide probes are modified oligonucleotides that are 
resistant to endogenous nucleases, exonucleases and/or endonucleases, thereby 
5 rendering them stable in vivo. Exemplary nucleic acid molecules for use as 
antisense oligonucleotides are phosphoramidate, phosphothioate and 
methylphosphonate analogs of DNA (see also U.S. Patent Nos. 5,176,996; 
5,264,564; and 5,256,775). Additionally, general approaches to constructing 
oligomers use&l in antisense therapy are also described, for example, by Van der 
10 Krol et aL, Biotechniques 6: 958-976 (1988); and Stein et aL, Cancer Res 48: 
2659-2668 (1988). With respect to antisense DNA, oligodeoxyribonucleotides 
derived from the translation initiation site, e.g. between the -10 and +10 regions of 
an HDAC9, HDAC9a 9 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic 
acid sequence, are preferred. 
1 5 To perform antisense therapy, oligonucleotides (RNA, cDNA or DNA) are 

designed that are complementary to mRNA encoding an HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) polypeptide. The antisense 
oligonucleotides bind to HDAC9, HDAC9a 9 HDAC9(ANLS) 9 HDAC9a(ANLS) 9 or 
HDRP(ANLS) mRNA transcripts and prevent translation. Absolute 
20 complementarity, although preferred, is not required. A sequence "complementary" 
to a portion of an RNA, as referred to herein, indicates that a sequence has sufficient 
complementarity to be able to hybridize with the RNA, forming a stable duplex; in 
the case of double-stranded antisense nucleic acids, a single strand of the duplex 
DNA may thus be tested, or triplex formation may be assayed. The ability to 
25 hybridize will depend on both the degree of complementarity and the length of the 
antisense nucleic acid, as described in detail above. Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with an RNA it may contain and 
still form a stable duplex (or triplex, as the case may be). One skilled in the art can 
ascertain a tolerable degree of mismatch by use of standard procedures. 
30 The oligonucleotides used in antisense therapy can be DNA, RNA, or 

chimeric mixtures or derivatives or modified versions thereof, single-stranded or 
double-stranded. The oligonucleotides can be modified at the base moiety, sugar 
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moiety, or phosphate backbone, for example, to improve stability of the molecule, 
hybridization, etc. The oligonucleotides can include other appended groups such as 
peptides (e.g. for targeting host cell receptors in vivo), or compounds facilitating 
transport across the cell membrane (see, e.g., Letsinger et al., Proc. Natl. Acad. Sci. 
5 USA 86: 6553-6556 (1989); Lemaitre et al, Proc. Natl. Acad Sci. USA 84: 648-652 
(1987); PCT International Publication No. W088/09810)) or the blood-brain barrier 
(see, e.g., PCT International Publication No. W089/10134), or 
hybridization-triggered cleavage agents (see, e.g., Krol et al, BioTechniques 6: 
958-976 (1988)) or intercalating agents. (See, e.g, Zon, Pharm. Res. 5: 539-549 

10 (1988)). To this end, the oligonucleotide may be conjugated to another molecule 
{e.g., a peptide, hybridization triggered cross-linking agent, transport agent, 
hybridization-triggered cleavage agent). 

The antisense molecules are delivered to cells that express HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in vivo. A number of 

15 methods can be used for delivering antisense DNA or RNA to cells; e.g, antisense 
molecules can be injected directly into the tissue site, or modified antisense 
molecules, designed to target the desired cells (e.g., antisense linked to peptides or 
antibodies that specifically bind receptors or antigens expressed on the target cell 
surface) can be administered systematically. Alternatively, in a preferred 

20 embodiment, a recombinant DNA construct is utilized in which the antisense 

oligonucleotide is placed under the control of a strong promoter (e.g., pol HI or pol 
IE). The use of such a construct to transfect target cells in the patient results in the 
transcription of sufficient amounts of single stranded RNAs that will form 
complementary base pairs with the endogenous HDAC9, HDAC9a, HDAC9(ANLS), 

25 HDAC9a(ANLS), or HDRP(ANLS) transcripts and thereby prevent translation of the 
HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) mRNA. 
For example, a vector can be introduced in vivo such that it is taken up by a cell and 
directs the transcription of an antisense RNA. Such a vector can remain episomal or 
become chromosomally integrated, as long as it can be transcribed to produce the 

30 desired antisense RNA. Such vectors can be constructed by recombinant DNA 
technology methods standard in the art and described above. For example, a 
plasmid, cosmid, YAC, or viral vector can be used to prepare the recombinant DNA 
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construct that can be introduced directly into the tissue site. Alternatively, viral 
vectors can be used that selectively infect the desired tissue, in which case 
administration may be accomplished by another route (e.g., systematically). 
Endogenous HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or 
5 HDRP(ANLS) expression can also be reduced by inactivating or "knocking out" 
HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a( ANLS), or HDRP(ANLS) nucleic acid 
sequences or their promoters using targeted homologous recombination (e.g., see 
Smithies et al, Nature 317: 230-234 (1985); Thomas and Capecchi, Cell 51: 
503-512 (1987); Thompson etal., Cell 5: 313-321 (1989)). For example, a mutant, 
1 0 non-functional HDAC9, HDAC9a, HDAC9(ANLS) > HDAC9a(ANLS), or 
HDRP(ANLS) (or a completely unrelated DNA sequence) flanked by DNA 
homologous to the endogenous HDAC9, HDAC9a, HDAC9(ANLS), 
HDA C9a(ANLS), or HDRP(ANLS) (either the coding regions or regulatory regions 
of HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS)) can be 
1 5 used, with or without a selectable marker and/or a negative selectable marker, to 
transfect cells that express HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) in vivo. Insertion of the DNA construct, via targeted homologous 
recombination, results in inactivation of HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS). The recombinant DNA constructs can be 
20 directly adininistered or targeted to the required site in vivo using appropriate 
vectors, as described above. Alternatively, expression of non-mutant HDAC9, 
HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), ox HDRP(ANLS) can be increased 
using a similar method: Targeted homologous recombination can be used to insert a 
DNA construct comprising a non-mutant, functional HDAC9, HDAC9a, 
25 HDA C9(ANLS), HDAC9a(ANLS), oxHDRP(ANLS) (e.g, a gene having SEQ ID 
NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9, which 
may optionally comprise at least one polymorphism), or a portion thereof, in place 
of a mutant HDAC9, HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(ANLS) 
in the cell, as described above. In another embodiment, targeted homologous 
recombination can be used to insert a DNA construct comprising a nucleic acid that 
encodes an HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
HDRP(ANLS) polypeptide variant that differs from that present in the cell. 



30 
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Alternatively, endogenous HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of HDAC9, 
HDAC9a, HDAC9(ANLS), HDA C9a(ANLS), or HDRP(MLS) (i.e., the HDAC9, 
5 HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) promoter and/or 
enhancers) to form triple helical structures that prevent transcription of HDAC9, 
HDAC9a, HDAC9(ANLS) 9 HDAC9a(ANLS), or HDRP(ANLS) in target cells in the 
body. (See generally, Helene Anticancer Drug Des., 6(6): 569-84 (1991); Helene et 
al., Ann, N.Y. Acad. Sci., 660: 27-36 (1992); andMaher, Bioassays 14(12): 807-15 

10 (1992)). Likewise, the antisense constructs described herein, by antagonizing the 
normal biological activity of one of the HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) proteins, can be used in the manipulation of 
tissue, e.g., tissue differentiation, both in vivo and for ex vivo tissue cultures. 
Furthermore, the antisense techniques (e.g., microinjection of antisense molecules, 

15 or transfection with plasmids whose transcripts are anti-sense with regard to an 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) mRNA or 
gene sequence) can be used to investigate role of HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in developmental events, as 
well as the normal cellular function of HDAC9, HDAC9a, HDAC9(ANLS), 

20 HDAC9a(ANLS), or HDRP(ANLS) in adult tissue. Such techniques can be utilized 
in cell culture, but can also be used in the creation of transgenic animals. 

In yet another embodiment of the invention, other HDAC9, HDAC9a, 
HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) therapeutic compounds as 
described herein can also be used in the treatment or prevention of a cell 

25 proliferation disease, an apoptotic disease, or a cell differentiation disease. The 

therapeutic compounds can be delivered in a composition, as described above, or by 
themselves. They can be administered systemically, or can be targeted to a 
particular tissue. The therapeutic compounds can be produced by a variety of 
means, including chemical synthesis; recombinant production; in vivo production 

30 (e.g., a transgenic animal, such as U.S. Patent No. 4,873,3 16 to Meade et al), for 
example, and can be isolated using standard means such as those described herein. 
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A combination of any of the above methods of treatment (e.g., 
administration of non-mutant HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptide in conjunction with antisense 
therapy targeting mutant HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 
5 HDRP( WLS) mRNA; administration of a first variant encoded by HDAC9, 

HDAC9a, HDA C9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) in conjunction with 
antisense therapy targeting a second encoded by HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS), can also be used. 

In another embodiment, the invention is directed to HDAC9, HDAC9a, 
10 HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) nucleic acid molecules and 
HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or HDRP(ANLS) 
polypeptides for use as a medicament in therapy. For example, the nucleic acid 
molecules or polypeptides of the present invention can be used in the treatment of a 
cell proliferation disease, an apoptotic disease, or a cell differentiation disease. Li 
15 addition, the HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), or 

HDRP(ANLS) nucleic acid molecules and HDAC9, HDAC9a, HDAC9(ANLS), 
HDAC9a(ANLS), or HDRP(ANLS) polypeptides described herein can be used in 
the manufacture of a medicament for the treatment of a cell proliferation disease, an 
apoptotic disease, or a cell differentiation disease. 
20 The invention will be further described by the following non-limiting 

examples. The teachings of all publications cited herein are incorporated herein by 
reference in their entirety. 



EXEMPLIFICATION 
25 Cloning ofcDNA encodes a novel HDAC, designated HDAC9 

HDAC9 was cloned by PCR and 3' rapid amplification of cDNA ends using 
primers designed from the sequence of human chromosome 7 whose translated 
product exhibited 80% identity to the HDAC domain of HDAC4, described in detail 
as follows. 

30 Database analyses indicate that HDRP is located on chromosome 7 (7pl5- 

p21). The human genome database (February 2001 release) of GenBank was 
searched using the human HDAC4 amino acid sequence. The TBLASTN program 
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was used to identify open reading frames downstream of HDRP on chromosome 7 
that exhibit significant homology to the HDAC domain of HDAC4. Several 
fragments whose translated products exhibit over 58% identity were retrieved. Two 
sense primers (OL486, 5 f -CCATGGAAACGGTACCCAGCAGGC-3 , (SEQ ID NO: 
5 16) and OL487, 5'-CACTCCATCGCTATGATGAAGGG-3' (SEQ ID NO: 17)) and 
antisense primers (OL484, 5 , -AGTTCCCTTCATCATAGCGATGG-3 , (SEQ ID 
NO: 18) and OL485, 5»-AATGTACAGGATGCTGGGGT-3' (SEQ ID NO: 19)) 
each were designed based upon one of these fragments whose translated products 
matched amino acids 842-873 of HDAC4. RT-PCR was performed using each of 

10 the antisense primers and a sense primer 

(5'-CCCTTGTAGCTGGTGGAGTTCCCTT-3 f (SEQ ID NO: 20)) from the coding 
region of HDRP and human brain cDNA as a template. PCR was performed in a 
Biometra TGRADIENT Thermocycler for 30 cycles at 95°C for 20 seconds, 60°C 
for 20 seconds, and 72°C for 120 seconds. 

1 5 3 f -rapid amplification of cDNA ends was performed using the sense primer 

OL486 and adaptor primer 1 (Clontech), and marathon-ready cDNA from human 
brain (Clontech, Palo Alto, CA) according to the manufacturer's instruction. The 
products were re-amplified using nested sense primer OL487 and adaptor primer 2 
(Clontech, Palo Alto, CA). PCR products were cloned into pGEM-T-easy vector 

20 (Promega, Madison, WI) and sequenced using an automated DNA sequencer at the 
DNA Sequencing Core Facility of the Memorial Sloan-Kettering Cancer Center, 
using DNA sequencing methods known to one of skill in the art. 

Two cDNAs were cloned from the above-described methods. One cDNA 
(SEQ ID NO: 1) encodes an HDAC9 protein that is 101 1 amino acids in length. The 

25 other cDNA (SEQ ID NO: 3) encodes an HDAC9a protein that is 879 amino acids 
long. The cDNA sequence and amino sequence of HDAC9 and HDAC9a are shown 
in FIGS. 1 A-1G and FIGS. 2A-2B, respectively. Database analyses of these cDNAs 
against human genomic DNA sequences indicated that these two cDNAs are 
generated by alternatively splicing. An alignment of HDAC9, HDAC9a, HDRP, 

30 and HDAC4 is shown in FIGS. 3A-3C. 

Each of the HDAC9 and HDAC9a nucleic acid sequences were cloned into 
the pFLAG-CMV-5b vector (Sigma) in frame with the C-terminal FLAG tag. Only 
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20 



25 



30 



the coding regions plus three extra base pairs (ACQ of cDNA of the HDAC9 and 
HDAC9a nucleic acid sequences were included in the constructs. These constructs 
are referred to herein as HDAC9-FLAG and HDAC9a-FLAG, respectively. These 
constructs are contained mE. ^andean readily be expressed. ForHDAC9,the 
insert is 3033 bp and for HDAC9a, the insert size is 2637 bp. Both HDAC9 and 
HDACPa can be released with EcoRV and BamHI (whose sites have been 
incorporated in the primers to obtain HDAC9 and HDAC9a coding cDNA for 
cloning purpose) restriction enzyme digestion. 

The HDAC9 cDNA sequences from the known 5'-end of HDRP cDNA to the 
3'-untranslated region cloned in this study cover over 51 1 kb of genomic DNA on 
chromosome 7. As shown in FIG. 4, the coding region cDNA of HDAC9 resides in 
23 exons spanning 458 kb of genomic sequence. Exons 21, 22, and 23 are one 
single exon in HDAC9a, but the middle exon that is numbered exon 22 in FIG. 4, 
containing an in-frame stop codon, is spliced out in HDAC9. In addition, exons 12 
and 13 are a single exon used by HDRP. Exon 13 is spliced as part of an intron in 
HDAC9 and HDAC9a. 

Further analysis revealed that exon 7, which contains a nuclear localization 
signal (MLS) is alternatively spliced in an HDRP isoform, creating HDRP(ANLS). 
RT-PCR analyses using primers based on sequences from exon 6 and exon 14 
indicate that this alternative splicing event also occurs in HDAC9 and/or HDAC9a. 
Thus, it is possible that at least 6 proteins can be generated from a single HDAC9 
gene by alternatively splicing of its RNA. The cDNA sequences and amino acid 
sequences for HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and 
HDRP(ANLS) are shown in FIGS. 1A-10 and 2A-2E, respectively. 

HDAC9 mRNA is differentially expressed among human tissues 

The expression of HDAC9 mRNA was determined by Northern blot analysis 
using a human multiple tissue Northern blot (Clontech, Palo Alto, CA). 
Hybridization was performed according to the manufacturer's instruction using 
ExPressHyb solution (Clontech, Palo Alto, CA). The 32 P-random priining labeled 
S'-untranslated region common to both HDAC9 and HDAC9a that shares no 
significant sequence homology with HDRP was used as a probe. Two transcripts at 
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9.8 and 4.1 kb were detected in all tissues^examined (FIG. 6A). The 4.1 kb 
transcript is shorter than the 4.4 kb HDRP transcript (See Zhou, et al, Proc. Natl. 
Acad. Sci. USA, 97:1056-1061 (2000)). A third transcript at 1.2 kb was detected in 
placenta (FIG. 6A). Similar to HDRP (See Zhou, X., et al, Proc. Natl. Acad. Sci. 
5 USA, 97:1056-1061 (2000)), high levels of HDAC9 transcripts were detected in 
brain and skeletal muscle (FIG. 6A). 

The distribution of alternatively spliced mRNA variants among tissues was 
examined by RT-PCR using primers (OL516 5'-TGTGTCATCGAGCTGGCTTC-3' 
(SEQ ID NO: 21) and OL517 5 ATCTTCTGC AAGTGGCTCC A-3 ' (SEQ ID NO: 

10 22)) spanning the alternatively spliced exon 22 and cDNA panel from the same 
tissues as the multiple tissue Northern blot. PCR was performed in a Biometra 
TGRADIENT Thermocycler for 30 cycles at 95°C for 20 seconds, 60°C for 20 
seconds, and 72°C for 60 seconds. The expected sizes of PCR products were 680 
base pairs for HDAC9 and 993 base pairs for HDAC9a. The ratio of HDAC9 and 

15 HDAC9a transcripts differed among tissues (FIG. 6B). In the placenta and kidney, 
the levels of the two transcripts were about the same (FIG. 6B). hi the brain, heart, 
and pancreas, there were more transcripts of HDAC9 than HDAC9a. In the other 
tissues examined, there were more HDAC9a transcripts than HDAC9 transcripts 
(FIG. 6B). Under the conditions tested, HDAC9 transcripts were undetectable in 

20 liver (FIG. 6B). The lung had an HDAC9 product that was larger than expected and 
abundant. The lung also had low levels of HDAC9 transcripts and HDAC9a 
transcripts (FIG. 6B). An additional PCR product was also amplified from cDNA of 
the pancreas; this product was than the expected products from HDAC9 and 
HDAC9a (FIG. 6B). The identity of the different sized transcripts is unknown. 

25 

HDAC9 and HDAC9a possess histone deacetylase activity 

HDAC9 was named based on sequence homology to HDAC4 (FIGS. 3A- 

3C). To determine whether HDAC9 and HDAC9a possess HDAC activity, an 

HDAC enzymatic assay was performed using anti-FLAG immunoprecipitated 
30 HDAC9-FLAG and HDAC9a-FLAG. 

C-terminal FLAG-tagged HDAC9 (HDAC9-FLAG) and HDAC9a 

(HDAC9a-FLAG) expression vectors were constructed using the pFLAG-CMV-5b 
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vector (Sigma) and PCR amplified coding regions of HDAC9 and HDACPa in 
frame with the FLAG-tag to form P FLAG-CMV-5b-HDAC9 (plasmid VRl) and 
pFLAG-CMV-5b-HDAC9a (plasmid VR2). All constructs were confirmed by DNA 
sequencing. 

i Transfection of human kidney 293T cells, immunoprecipitation using anti- 

FLAG M2 Agarose (Sigma), Western blot analyses and dual luciferase assays were 
performed essentially as previously described by Zhou era/. (Proc. Natl. Acad Sci 
USA 97:1056-1061 (2000)). Briefly, the cells (American Type Culture Collection) 
were cultured in DME HG medium (GBCO/BRL) supplemented with 10% 
(vol/vol) FBS at 37 °C in a 5% C0 2 atmosphere. Transient transfection was 
performed by using Lipofectamine (GIBCO/BRL) or Fugene 6 (Roche Molecular 
Biochemicals) according to the manufacturers' instructions. Cells were harvested 
24 to 48 hours after transfection and lysed in IP lysis buffer (50 mM Tris HCl, pH 
7.5/120 mM NaCV5 mM EDTAA).5% NP-40) at 5 x 10 7 cells per ml. 
Immunoprecipitation with anti-FLAG M2-agarose (Sigma, St. Louis, MO) was 
performed according to the manufacturer's instructions, hnmunoprecipitated 
proteins were released from the agarose beads by using FLAG-peptide and either 
used directly for HDAC enzymatic activity assays or resolved on SDS/PAGE for 
Western blot analyses. Anti-FLAG antibody was purchased from Sigma (St Louis, 
MO). Western blot analyses were performed using standard methods. 

HDAC9 and HDAC9a enzymatic activity were assessed with the HDAC 
Fluorescent Activity Assay/Drug Discovery Kit-AK-500 (BIOMOL Research 
Laboratories) using a FLUOR DE LYS™ that contains an acetylated lysine side 
chain as a substrate and immunoprecipitated HDAC9-FLAG and HDAC9a-FLAG 
polypeptides according to the manufacturer's instruction and a SPECTRAmax® 
GEMINI XS microplate spectrofluorometer using the SOFTmax® PRO system 
(Molecular Devices) at excitation 355 nm and emission 460 nm with a cut off filter 
of 455 nm. Briefly, HDAC9-FLAG and HDAC9a-FLAG were incubated with the 
substrate overnight at room temperature in a 96-well plate. The reaction was 
stopped by addition of Fluor De Lys™ Developer and samples were read with the 
fluorometer. 
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As shown in FIG. 7, both HDAC9-FLAG and HDAC9a-FLAG deacetylated 
the acetylated lysine of FLUOR DE LYS™ and the activity of HDAC9 and 
HDAC9a was comparable. To examine the activity of HDAC9 and HDAC9a, 
inhibition studies using TSA were carried out by preincubating HD AC9-FLAG and 
5 HDAC9a-FLAG with TSA for 15 minutes at room temperature. The assay was then 
carried out as stated above. As shown in FIG. 7, TSA inhibited HDAC9 and 
HDAC9a deacetylase activity. The inset gel in FIG. 7 shows the amount of protein 
used in the assay. SAHA, a potent HDAC inhibitor (Richon et al, Proc. Natl. Acad. 
Sci. USA, 95:3003-3007 (1998)) also completely inhibited the histone deacetylase 
10 activity of HDAC9-FLAG and HDAC9a-FLAG. The HDAC activity of HDAC9 
and HDAC9a was about ten times lower than the deacetylase activity of HDAC4 
when comparable amount of protein was used under conditions tested here. 

HDAC9 and HDAC9a enzymatic activity was also determined through 
HDAC enzymatic assays using 3 H-histones isolated from murine erythroleukemia 
1 5 cells as a substrate. This assay was performed essentially as described by Richon et 
al (Proc. Natl. Acad. Sci. USA, 95:3003-3007 (1998)). Briefly, HDAC9-FLAG 
and HDAC9a-FLAG were incubated with 3 H-histones overnight at 37°C. The 
reaction was stopped by the addition of 1M HC1/0.1 acetic acid. Released 3 H-acetic 
acid was extracted with ethyl acetate and quantified by scintillation counting. For 
20 inhibition studies, the immunoprecipitated complexes were preincubated with the 
different HDAC inhibitors for 30 minutes at 4°C. 

As shown in FIG. 8, HDAC9a-FLAG deacetylated 3 H-acetyl-histones. 
SAHA, a potent HDAC inhibitor also completely inhibited the histone deacetylase 
activity of HD AC9a-FLAG. TSA also inhibited HDAC9a deacetylase activity. 
25 Similar results were obtained when HDAC9 was used as the enzyme source. 

HDAC9 and HDAC9a repress MEF2-mediated transcription 

The Xenopus homolog of HDRP, MTTR, was identified as a MEF2 
interacting transcriptional repressor (Sparrow et al, EMBO J. 18:5085-5098(1999)) 
30 and mouse HDRP also interacts with and represses MEF2 mediated transcription 
(Zhang et al, J. Biol. Chem. 276:35-39 (2001)). We first tested whether HDAC9- 
FLAG and HDAC9a-FLAG interact with MEF2. 293 cells were transfected with 
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vector, HDAC9-FLAG, or HDAC9a-FLAG. The cells were subsequently lysed and 
HDAC9-FLAG and HDAC9a-FLAG proteins were immunoprecipitated with anti- 
FLAG antibodies. Western blot analysis of the immunoprecipitated proteins was 
carried out, using anti-MEF-2 antibody to probe the blot. As shown in FIG. 9A, 
5 both HDAC9 and HDAC9a interacted with MEF2 in 293T cells. 

It was then detennined whether HDAC9 and HDAC9a repress MEF2- 
mediated transcription. This determination was carried out as follows. The 
p3XMEF2-luciferase reporter gene (100 ng) and the vector pRL-TK (Promega) (5 
ng) were co-transfected into 293T cells in the absence (pcDNA3 empty vector) or 
10 presence of MEF2C (100 ng of pCMV-MEF2C). HDAC9-F (1 ng, 10 ng, or 100 ng 
of pFIAG-HDAC9; pFLAG-HDAC9 and HDAC9-FLAG are different constructs, 
with the FLAG sequence located at opposite ends of the HDAC9 nucleotide, but are 
functionally equivalent) or HDAC9a-F (1 ng, 10 ng, or 100 ng of pFLAG-HDAC9a; 
pFLAG-HDAC9a and HDAC9a-FLAG are different constructs, with the FLAG 
sequence located at opposite ends of the HDAC9a nucleotide, but are functionally 
equivalent) was included in a subset of experimental groups with the MEF2C 
vector. pFLAG empty vector was used to adjust the DNA to an equal amount in 
each transfection. The cells were harvested 24 to 36 hours after transection and the 
Iuciferase activities were measured using the Dual-Luciferase™ Reporter Assay 
System from Promega according to the manufacturer's instruction. The firefly 
Iuciferase activity was first normalized to the co-transfected Renilla Iuciferase 
activity (encoded by the pRL-TK vector), and the Iuciferase activity value for cells 
transfected with MEF2C alone was set at 1. MEF2C activated transcription over 30 
times the basal level of transcription. As shown in FIG 9B, HDAC9-FLAG and 
25 HDAC9a-FLAG repressed MEF2C mediated transcriptional activation in a dose- 
dependent manner and completely abolished the activation at the 100 ng dose for 
both HDAC9 and HDAC9a. The transcriptional repression effect of HDAC9 and 
HDAC9a on MEF2C mediated transcription was a specific effect since a co- 
transfected reporter gene for transfection efficiency containing a TK promoter was 
30 not repressed by HDAC9 or HDAC9a. 

Described herein is the identification and characterization of a new class H 
HDAC, designated HDAC9. HDAC9 has several alternatively spliced isoforms, 
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one of which is the previously identified HDRP (Zhou et al 9 Proc. Natl. Acad. Sci. 
USA 97:1056-1061 (2000)). HDAC9 and HDAC9a possess HDAC activity, which 
appears to have a lower specific enzymatic activity than HDAC4. While not 
wishing to be bound by any particular theory, it is possible that an essential co-factor 
5 is lost during immunoprecipitation or does not exist in 293T cells (for example, 
metastasis-associated protein 2 is essential for the assembly of a catalytically active 
HDAC1 (Zhang etal, Genes Dev. 13:1924-1935 (1999)), the substrates used are 
not its natural substrate, or the FLAG tag which interferes with the folding of the 
protein. 

1 0 Searching the human genome with the HDAC domain from either HDAC 1 

or HDAC9 identified a total of ,10 HDACs in the presently completed human 
genome sequence, a number of which are schematically represented in FIG. 10. 
HDACs 1, 2, 3, 8, 4, 5, 6, 7, 9, and 9a all have HDAC domains. HDRP, which is 
also schematically depicted in FIG. 10, does not have a catalytic domain. 

15 All references described herein are incorporated by reference in their 

entirety. While this invention has been particularly shown and described with 
reference to preferred embodiment thereof, it will be understood by those skilled in 
the art that various changes in form and details may be made therein without 
departing from the spirit and scope of the invention as defined by the appended 

20 claims. 
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CLAIMS 

What is claimed is: 

1. An isolated or recombinant histone deacetylase polypeptide, said polypeptide 
selected from: 

a) an isolated or recombinant polypeptide comprising SEQ ID NO: 2, 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; 
and 

b) an isolated or recombinant polypeptide having at least 60% sequence 
identity with any one of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 
6, SEQ ID NO: 8, or SEQ ID NO: 10. 

I. The isolated or recombinant histone deacetylase polypeptide of Claim 1, said 
polypeptide selected from: 

a) a polypeptide consisting of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10. 

!. The isolated or recombinant histone deacetylase polypeptide of Claim 1, 
wherein said polypeptide is human. 

An isolated nucleic acid molecule selected from the group: 

a) an isolated nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 3, 
SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9; 

b) a complement of an isolated nucleic acid comprising SEQ ID NO: 1, 
SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, or SEQ ID NO: 9 

c) an isolated nucleic acid encoding a histone deacetylase polypeptide 
of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 8, or 
SEQ ID NO: 10; 
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d) a complement of an isolated nucleic acid encoding a histone 
deacetylase polypeptide of SEQ ID NO: 2, SEQ ID NO: 4, SEQ ID 
NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10; 

e) a nucleic acid that is hybridizeable under high stringency conditions 
5 to a nucleic acid molecule that encodes any of SEQ ID NO: 2, SEQ 

ID NO: 4, SEQ ID NO: 6, or SEQ ID NO: 8, or a complement 
thereof; or 

f) a nucleic acid molecule that is hybridizeable under high stringency 
conditions to a nucleic acid comprising SEQ ID NO: 1, SEQ ID NO: 

10 3, SEQ ID NO: 5, or SEQ ID NO: 7; and 

g) an isolated nucleic acid molecule that has at least 55% sequence 
identity with any one of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 
5, SEQ ID NO: 7, SEQ ID NO: 9, or a complement thereof. 



15 5, . The isolated nucleic acid molecule of Claim 4, said nucleic acid molecule 

consisting of the nucleic acid molecule selected from the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, SEQ ID NO: 7, and SEQ ID 
NO: 9. 

20 6. The isolated nucleic acid molecule of Claim 4, wherein said nucleic acid 
molecule is human. 



7. A vector comprising the isolated nucleic acid molecule of Claim 4. 
25 8 . A cell comprising the vector of Claim 7 . 

9. A cell comprising the isolated nucleic acid molecule of Claim 4. 



10. A purified antibody that selectively binds a polypeptide of Claim 1 . 

30 

11. A method of identifying a compound that modulates expression of a nucleic 
acid molecule of Claim 4, said method comprising the steps of: 
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a) contacting said nucleic acid molecule with a candidate compound 
under conditions suitable for expression; and 

b) assessing the level of expression of said nucleic acid molecule, 
wherein a candidate compound that increases or decreases expression of said 

5 nucleic acid molecule relative to a control is a compound that modulates 

expression of said nucleic acid molecule. 

12. The method of Claim 1 1, wherein said method is carried out in a cell or 
animal. 

10 

13. The method of Claim 1 1, wherein said method is carried out in a cell free 
system. 

14. A method of identifying a compound that modulates the enzymatic activity 
15 of the polypeptide of Claim 1 , said method comprising the steps of: 

a) contacting said polypeptide with a candidate compound under 
conditions suitable for enzymatic reaction; and 

b) assessing the enzymatic activity level of said polypeptide, 
wherein a candidate compound that increases or decreases the enzymatic 

20 activity level of said polypeptide relative to a control is a compound that 

modulates the enzymatic activity of said polypeptide. 

15. The method of Claim 14, wherein said method is carried out in a cell or 
animal. 

25 

1 6. The method of Claim 14, wherein said method is carried out in a cell free 
system. 



17. 

30 



The method of Claim 14, wherein said polypeptide is further contacted with 
a substrate for the polypeptide, and wherein said substrate is selected from 
the group consisting of a cell proliferation disease binding agent, an 
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apoptotic disease binding agent, and a cell differentiation disease binding 
agent. 

18. The method of Claim 17, wherein said candidate compound is an inhibitor. 

5 

19. The method of Claim 17, wherein said candidate compound is an activator. 

20. A method of identifying a compound that modulates the transcriptional 
repression activity of the polypeptide of Claim 1, said method comprising 

10 the steps of: 

a) contacting said polypeptide with a candidate compound under 
conditions suitable for a transcriptional repression reaction; and 

b) assessing the transcriptional repression activity level of said 
polypeptide, 

15 wherein a candidate compound that increases or decreases the transcriptional 

repression activity level of said polypeptide relative to a control is a 
compound that modulates the transcriptional repression activity of said 
polypeptide. 

20 21 . The method of Claim 20, wherein said method is carried out in a cell or 
animal. 



22. The method of Claim 20, wherein said method is carried out in a cell free 
system. 

25 

23. The method of Claim 20, wherein said polypeptide is further contacted with 
a substrate for the polypeptide, and wherein said substrate is selected from 
the group consisting of a cell proliferation disease binding agent, an 
apoptotic disease binding agent, and a cell differentiation disease binding 

30 agent. 



24. 



The method of Claim 23, wherein said candidate compound is an inhibitor. 
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25. The method of Claim 23, wherein said candidate compound is an activator. 

26. A method of identifying a compound that modulates expression of a nucleic 
acid molecule of Claim 4, said method comprising the steps of: 

5 a) providing a nucleic acid molecule comprising a promoter region of 

said nucleic acid of Claim 4 or part of a promoter region of said 
nucleic acid of Claim 4 operably linked to a reporter gene; 
b) contacting said nucleic acid molecule or with a candidate compound; 
and 

10 c ) assessing the level of said reporter gene, 

wherein a candidate compound that increases or decreases expression of said 
reporter gene relative to a control is a compound that modulates expression 
of said nucleic acid molecule of Claim 4. 



15 27. 



28. 



20 



The method of Claim 26, wherein said method is carried out in a cell. 



A method of identifying a polypeptide that interacts with a polypeptide of 
Claim 1 in a yeast two-hybrid system, said method comprising the steps of: 

a) providing a first nucleic acid vector comprising a nucleic acid 
molecule encoding a DNA binding domain and said polypeptide of 
Claim 1; 

b) providing a second nucleic acid vector comprising a nucleic acid 
encoding a transcription activation domain and a nucleic acid 
encoding a test polypeptide; 

25 c) contacting said first nucleic acid vector with said second nucleic acid 

vector in a yeast two-hybrid system; and 
d) assessing transcriptional activation in said yeast two-hybrid system, 
wherein an increase in transcriptional activation relative to a control 
indicates that the test polypeptide is a polypeptide that interacts with said 

30 polypeptide of Claim 1 . 

29. A pharmaceutical composition comprising a polypeptide of Claim 1 . 
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30. A method of diagnosing a cell proliferation disease, an apoptotic disease, or 
a cell differentiation disease in a subject, said method comprising the steps 
of: 

a) obtaining a sample from said subject; and 
5 b) assessing the level of activity or expression of said polypeptide of 

Claim 1 in said sample, or detecting the level of said nucleic acid 
molecule of Claim 4, 
wherein if said level is increased relative to a control, then said subject has 
an increased likelihood of having a cell proliferation disease, an apoptotic 
10 disease, or a cell differentiation disease, and wherein if said level is 

decreased relative to a control, then said subject has a decreased likelihood 
- of having a cell proliferation disease, an apoptotic disease, or a cell 
differentiation disease. 

15 31. The method of Claim 30, wherein said level of activity or expression of said 
polypeptide of Claim 1 in said sample is measured using 
immunohistochemical techniques. 



32. The method of Claim 30, wherein said level of said nucleic acid molecule of 
20 Claim 4 in said sample is measured using in situ hybridization techniques. 

33. A method of treating a cell proliferation disease, an apoptotic disease, or a 
cell differentiation disease, said method comprising administering a 
compound identified by the method of Claim 14, 

25 

34. A method of treating a cell proliferation disease, an apoptotic disease, or a 
cell differentiation disease, said method comprising administering a 
compound identified by the method of Claim 20. 



30 
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SEQUENCE LISTING 

<110> Sloan-Kettering Institute for Cancer Research 
Richon, Victoria 
Zhou, Xianbo 
Rifkind, Richard A. 
Marks, Paul A. 

<120> HDAC9 Polypeptides and Polynucleotides 
and Uses Thereof 

<130> 3254.1000005 

<150> 60/298,173 
<151> 2001-06-14 

<150> 60/311,686 
<151> 2001-08-10 

<150> 60/316,995 
<151> 2001-09-04 

<160> 22 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 3186 
<212> DNA 

<213> Homo sapiens 
<400> 1 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgcctct gagcccaact tgaaggtgcg gtccaggtta 840 
aaacagaaag tggcagagag gagaagcagc cccttactca ggcggaagga tggaaatgtt 900 
gtcacttcat tcaagaagcg aatgtttgag gtgacagaat cctcagtcag tagcagttct 960 
ccaggctctg gtcccagttc accaaacaat gggccaactg gaagtgttac tgaaaatgag 1020 
acttcggttt tgccccctac ccctcatgcc gagcaaatgg tttcacagca acgcattcta 1080 
attcatgaag attccatgaa cctgctaagt ctttatacct ctccttcttt gcccaacatt 1140 
accttggggc ttcccgcagt gccatcccag ctcaatgctt cgaattcact caaagaaaag 1200 
cagaagtgtg agacgcagac gcttaggcaa ggtgttcctc tgcctgggca gtatggaggc 1260 
agcatcccgg catcttccag ccaccctcat gttactttag agggaaagcc acccaacagc 1320 
agccaccagg ctctcctgca gcatttatta ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cttgtagctg gtggagttcc cttacatcct cagtctccct tggcaacaaa agagagaatt 1440 
tcacctggca ttagaggtac ccacaaattg ccccgtcaca gacccctgaa ccgaacccag 1500 
tctgcacctt tgcctcagag cacgttggct cagctggtca ttcaacagca acaccagcaa 1560 
ttcttggaga agcagaagca ataccagcag cagatccaca tgaacaaact gctttcgaaa 1620 
tctattgaac aactgaagca accaggcagt caccttgagg aagcagagga agagcttcag 1680 
ggggaccagg cgatgcagga agacagagcg ccctctagtg gcaacagcac taggagcgac 1740 
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agcagtgctt 
ccagtggaca 
tttatgcaac 
ccgctggctg 
tcttcccctg 
tctgcaactg 
tccaccaccc 
actgggctgc 
cagcttgttc 
aagctggacc 
tgtggtggac 
gcacgcatgg 
aagaatgggt 
gggttctgct 
ataagcaaga 
ttttatgctg 
ttccctggca 
aatattgcct 
ttcaggacca 
gctggatttg 
aaatgttttg 
gctctagaag 
gcccttctag 
atgaatgctg 
tcttaa 



gtgtggatga 
gtgatgaaga 
agcctttcct 
cggttggcat 
ctgcctctgt 
gaattgccta 
accctgagca 
taaataaatg 
attctgaaca 
ccaggatact 
ttggggtgga 
ctgttggctg 
ttgctgttgt 
tttttaattc 
tattgattgt 
accccagcat 
gtggagcccc 
ggacaggtgg 
tcgtgaagcc 
atgcattgga 
gtcatttgac 
gaggacatga 
gaaatgagct 
ttatttcttt 



cacactggga 
tgctcagatc 
ggaacccacg 
ggatggatta 
tttacctcac 
tgaccccttg 
tgctggacga 
tgagcgaatt 
tcactcactg 
cctaggtgat 
cagtgacacc 
tgtcatcgag 
gaggccccct 
agttgcaatt 
agatctggat 
cctgtacatt 
aaatgaggtt 
ccttgatcct 
tgtggccaaa 
aggccacacc 
gaagcaattg 
tctcacagcc 
ggagccactt 
acagaagatc 



caagttgggg 
caggaaatgg 
cacacacgtg 
gagaaacacc 
ccagcaatgg 
atgctgaaac 
atacagagta 
caaggtcgaa 
ttgtatggca 
gactctcaaa 
atttggaatg 
ctggcttcca 
ggccatcacg 
accgccaaat 
gttcaccatg 
tcactccatc 
ggaacaggcc 
cccatgggag 
gagtttgatc 
cctcctctag 
atgacattgg 
atctgtgatg 
gcagaagata 
attgaaattc 



ctgtgaaggt 
aatctgggga 
cgctctctgt 
gtctcgtctc 
accgccccct 
accagtgcgt 
tctggtcacg 
aagccagcct 
ccaaccccct 
agtttttttc 
agctacactc 
aagtggcctc 
ctgaagaatc 
acttgagaga 
gaaacggtac 
gctatgatga 
ttggagaagg 
atgttgagta 
cagacatggt 
gagggtacaa 
ctgatggacg 
catcagaagc 
ttctccacca 
aaagtatgtc 



caaggaggaa 
gcaggctgct 
gcgccaagct 
caggactcac 
ccagcctggc 
ttgtggcaat 
actgcaagaa 
ggaggaaata 
ggacggacag 
ctcattacct 
gtccggtgct 
aggagagctg 
cacagccatg 
ccaactaaat 
ccagcaggcc 
agggaacttt 
gtacaatata 
ccttgaagca 
cttagtatct 
agtgacggca 
tgtggtgttg 
ctgtgtaaat 
aagcccgaat 
tttaaagttc 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3186 



<210> 2 

<211> 1011 

<212> PRT 

<213> Homo sapiens 



<400> 2 
Met His 
1 

Gly Leu 

Met Met 

Glu Leu 

50 
lie Ala 
65 

Ala Gin 

Gin Glu 

Gin Glu 

Gly Lys 
130 
Gin Lys 
145 

Pro Thr 

Tyr Thr 

Ser Gly 

Ala Lys 
210 
Lys Val 
225 



Ser Met He 
5 

Glu Pro He 
20 

Pro Val Val 
35 

Leu Leu He 

Glu Phe Gin 

Leu Gin Glu 
85 

Leu Leu Glu 

100 
Val Glu Arg 
115 

Asp Arg Gly 

Leu Gin Glu 

Asn Gly Lys 
165 

Ala Ala His 

180 
Thr Ser Pro 
195 

Asp Asp Phe 
Arg Ser Arg 



Ser Ser Val 

Ser Pro Leu 

Asp Pro Val 
40 

Gin Gin Gin 
55 

Lys Gin His 
70 

His He Lys 

Lys Glu Gin 

His Arg Arg 
120 

Arg Glu Arg 

135 
Phe Leu Leu 
150 

Asn His Ser 

His Thr Ser 

Ser Tyr Lys 
200 

Pro Leu Arg 

215 
Leu Lys Gin 
230 



Asp Val Lys 
10 

Asp Leu Arg 
25 

Val Arg Glu 

Gin Gin lie 

Glu Asn Leu 
75 

Glu Leu Leu 
90 

Lys Leu Glu 
105 

Glu Gin Gin 

Ala Val Ala 

Ser Lys Ser 
155 

Val Ser Arg 

170 
Leu Asp Gin 
185 

Tyr Thr Leu 

Lys Thr Ala 

Lys Val Ala 
235 



Ser Glu Val 

Thr Asp Leu 
30 

Lys Gin Leu 
45 

Gin Lys Gin 
60 

Thr Arg Gin 

Ala He Lys 

Gin Gin Arg 
110 

Leu Pro Pro 

125 
Ser Thr Glu 
140 

Ala Thr Lys 

His Pro Lys 

Ser Ser Pro 
190 

Pro Gly Ala 

205 
Ser Glu Pro 
220 

Glu Arg Arg 



Pro Val 
15 

Arg Met 

Gin Gin 

Leu Leu 

His Gin 

80 
Gin Gin 
95 

Gin Glu 

Leu Arg 

Val Lys 

Asp Thr 
160 
Leu Trp 
175 

Pro Leu 

Gin Asp 

Asn Leu 

Ser Ser 
240 
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Pro 


Leu 


Leu 


Arg Arg 


Lys Asp Gly Asn 


Val 


Val 


Thr 


Ser 


Phe 


Lys 


Lys 










245 






250 










255 


P. 


Arg 


Met 


Phe 


Glu 


Val 


Thr Glu 


Ser Ser 


Val 


Ser 


Ser 


Ser 


Ser 


Pro 


Gly 






260 






265 










270 






Ser 


Gly 


Pro 


Ser 


Ser 


Pro Asn Asn Gly 


Pro 


Thr 


Gly 


Ser 


Val 


Thr 


Glu 






275 








280 








285 








Asn 


Glu 


Thr 


Ser 


Val 


Leu Pro 


Pro Thr 


Pro 


His 


Ala 


Glu 


Gin 


Met 


Val 




290 








295 








300 










Ser 


Gin 


Gin 


Arg 


He 


Leu He 


His Glu 


Asp 


Ser 


Met 


Asn 


Leu 


Leu 


Ser 


305 










310 






315 










320 


Leu 


Tyr 


Thr 


Ser 


Pro 


Ser Leu 


Pro Asn 


He 


Thr 


Leu 


Gly 


Leu 


Pro 


Ala 








325 






330 










335 




Val 


Pro 


Ser 


Gin 


Leu 


Asn Ala 


Ser Asn 


Ser 


Leu 


Lys 


Glu 


Lys 


Gin 


Lys 








340 






345 










350 






Cys 


Glu 


Thr 


Gin 


Thr 


Leu Arg Gin Gly 


Val 


Pro 


Leu 


Pro 


Gly 


Gin 


Tyr 




355 








360 








365 








Gly 


Gly 


Ser 


He 


Pro 


Ala Ser 


Ser Ser 


His 


Pro 


His 


Val 


Thr 


Leu 


Glu 




370 








375 








380 










Gly 


Lys 


Pro 


Pro 


Asn 


Ser Ser 


His Gin 


Ala 


Leu 


Leu 


Gin 


His 


Leu 


Leu 


385 








390 






395 










400 


Leu 


Lys 


Glu 


Gin 


Met 


Arg Gin 


Gin Lys 


Leu 


Leu 


Val 


Ala 


Gly 


Gly 


Val 










405 






410 










415 




Pro 


Leu 


His 


Pro 


Gin 


Ser Pro 


Leu Ala 


Thr 


Lys 


Glu 


Arg 


He 


Ser 


Pro 








420 






425 










430 






Gly 


lie 


Arg 


Gly Thr 


His Lys 


Leu Pro 


Arg 


His 


Arg 


Pro 


Leu 


Asn 


Arg 






435 








440 








445 








Thr 


Gin 


Ser 


Ala 


Pro 


Leu Pro 


Gin Ser 


Thr 


Leu 


Ala 


Gin 


Leu 


Val 


He 




450 








455 








460 










Gin 


Gin 


Gin 


His 


Gin 


Gin Phe 


Leu Glu 


Lys 


Gin 


Lys 


Gin 


Tyr 


Gin 


Gin 


465 










470 






475 










480 


Gin 


lie 


His 


Met 


Asn 


Lys Leu 


Leu Ser 


Lys 


Ser 


He 


Glu 


Gin 


Leu 


Lys 










485 






490 










495 




Gin 


Pro 


Gly 


Ser 


His 


Leu Glu 


Glu Ala 


Glu 


Glu 


Glu 


Leu 


Gin 


Gly 


Asp 






500 






505 










510 






Gin 


Ala 


Met 


Gin 


Glu 


Asp Arg Ala Pro 


Ser 


Ser 


Gly 


Asn 


Ser 


Thr 


Arg 






515 








520 








525 








Ser 


Asp 


Ser 


Ser 


Ala 


Cys Val 


Asp Asp 


Thr 


Leu 


Gly 


Gin 


Val 


Gly 


Ala 




530 








535 








540 










Val 


Lys 


Val 


Lys 


Glu 


Glu Pro 


Val Asp 


Ser 


Asp 


Glu 


Asp 


Ala 


Gin 


He 


545 










550 






555 










560 


Gin 


Glu 


Met 


Glu 


Ser 


Gly Glu Gin Ala 


Ala 


Phe 


Met 


Gin 


Gin 


Pro 


Phe 










565 






570 










575 




Leu 


Glu 


Pro 


Thr 


His 


Thr Arg Ala Leu 


Ser 


Val 


Arg 


Gin 


Ala 


Pro 


Leu 








580 






585 










590 






Ala 


Ala 


Val 


Gly Met 


Asp Gly Leu Glu 


Lys 


His 


Arg 


Leu 


Val 


Ser 


Arg 






595 








600 








605 








Thr 


His 


Ser 


Ser 


Pro 


Ala Ala 


Ser Val 


Leu 


Pro 


His 


Pro 


Ala 


Met 


Asp 




610 








615 








620 










Arg 


Pro 


Leu 


Gin 


Pro 


Gly Ser Ala Thr 


Gly 


He 


Ala 


Tyr 


Asp 


Pro 


Leu 


625 










630 






635 










640 


Met 


Leu 


Lys 


His 


Gin 


Cys Val 


Cys Gly 


Asn 


Ser 


Thr 


Thr 


His 


Pro 


Glu 










645 






650 










655 




His 


Ala 


Gly 


Arg 


He 


Gin Ser 


He Trp 


Ser 


Arg 


Leu 


Gin 


Glu 


Thr 


Gly 








660 






665 










670 






Leu 


Leu 


Asn 


Lys 


Cys 


Glu Arg 


He Gin 


Gly 


Arg 


Lys 


Ala 


Ser 


Leu 


Glu 






675 








680 








685 








Glu 


He 


Gin 


Leu Val 


His Ser 


Glu His 


His 


Ser 


Leu 


Leu 


Tyr 


Gly 


Thr 




690 








695 








700 










Asn 


Pro 


Leu 


Asp Gly 


Gin Lys Leu Asp 


Pro 


Arg 


He 


Leu 


Leu 


Gly 


Asp 


705 










710 






715 










720 


Asp 


Ser 


Gin 


Lys 


Phe 


Phe Ser 


Ser Leu 


Pro 


Cys 


Gly 


Gly 


Leu 


Gly 


Val 








725 






730 










735 
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Asp Ser Asp Thr He Trp Asn Glu Leu His Ser Ser Gly Ala Ala Arg 

740 745 750 

Met Ala Val Gly Cys Val He Glu Leu Ala Ser Lys Val Ala Ser Gly 

755 760 765 

Glu Leu Lys Asn Gly Phe Ala Val Val Arg Pro Pro Gly His His Ala 

770 775 780 

Glu Glu Ser Thr Ala Met Gly Phe Cys Phe Phe Asn Ser Val Ala He 
785 790 795 800 

Thr Ala Lys Tyr Leu Arg Asp Gin Leu Asn He Ser Lys He Leu He 

805 810 815 

Val Asp Leu Asp Val His His Gly Asn Gly Thr Gin Gin Ala Phe Tyr 

820 825 830 

Ala Asp Pro Ser He Leu Tyr He Ser Leu His Arg Tyr Asp Glu Gly 

835 840 845 

Asn Phe Phe Pro Gly Ser Gly Ala Pro Asn Glu Val Gly Thr Gly Leu 

850 855 860 

Gly Glu Gly Tyr Asn He Asn He Ala Trp Thr Gly Gly Leu Asp Pro 
865 870 875 880 

Pro Met Gly Asp Val Glu Tyr Leu Glu Ala Phe Arg Thr He Val Lys 

885 890 895 

Pro Val Ala Lys Glu Phe Asp Pro Asp Met Val Leu Val Ser Ala Gly 

900 905 910 

Phe Asp Ala Leu Glu Gly His Thr Pro Pro Leu Gly Gly Tyr Lys Val 

915 920 925 

Thr Ala Lys Cys Phe Gly His Leu Thr Lys Gin Leu Met Thr Leu Ala 

930 935 940 

Asp Gly Arg Val Val Leu Ala Leu Glu Gly Gly His Asp Leu Thr Ala 
9« 950 955 ~ 960 

He Cys Asp Ala Ser Glu Ala Cys Val Asn Ala Leu Leu Gly Asn Glu 

965 970 975 

Leu Glu Pro Leu Ala Glu Asp He Leu His Gin Ser Pro Asn Met Asn 

980 985 990 

Ala Val He Ser Leu Gin Lys He lie Glu He Gin Ser Met Ser Leu 

995 1000 1005 

Lys Phe Ser 
1010 



<210> 3 

<211> 3499 

<212> DNA 

<213> Homo sapiens 

<400> 3 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgcctct 
aaacagaaag tggcagagag gagaagcagc 
gtcacttcat tcaagaagcg aatgtttgag 
ccaggctctg gtcccagttc accaaacaat 
acttcggttt tgccccctac ccctcatgcc 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 3 00 
aagcagcttc tgatagcaga gtttcagaaa 360 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
gagcccaact tgaaggtgcg gtccaggtta 840 
cccttactca ggcggaagga tggaaatgtt 900 
gtgacagaat cctcagtcag tagcagttct 960 
gggccaactg gaagtgttac tgaaaatgag 1020 
gagcaaatgg tttcacagca acgcattcta 1080 
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attcatgaag attccatgaa cctgctaagt ctttatacct ctccttcttt gcccaacatt 1140 
accttggggc ttcccgcagt gccatcccag ctcaatgctt cgaattcact caaagaaaag 1200 
cagaagtgtg agacgcagac gcttaggcaa ggtgttcctc tgcctgggca gtatggaggc 1260 
agcatcccgg catcttccag ccaccctcat gttactttag agggaaagcc acccaacagc 1320 
agccaccagg ctctcctgca gcatttatta ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cttgtagctg gtggagttcc cttacatcct cagtctccct tggcaacaaa agagagaatt 1440 
tcacctggca ttagaggtac ccacaaattg ccccgtcaca gacccctgaa ccgaacccag 1500 
tctgcacctt tgcctcagag cacgttggct cagctggtca ttcaacagca acaccagcaa 1560 
ttcttggaga agcagaagca ataccagcag cagatccaca tgaacaaact gctttcgaaa 1620 
tctattgaac aactgaagca accaggcagt caccttgagg aagcagagga agagcttcag 1680 
ggggaccagg cgatgcagga agacagagcg ccctctagtg gcaacagcac taggagcgac 1740 
agcagtgctt gtgtggatga cacactggga caagttgggg ctgtgaaggt caaggaggaa 1800 
ccagtggaca gtgatgaaga tgctcagatc caggaaatgg aatctgggga gcaggctgct 1860 
tttatgcaac agcctttcct ggaacccacg cacacacgtg cgctctctgt gcgccaagct 1920 
ccgctggctg cggttggcat ggatggatta gagaaacacc gtctcgtctc caggactcac 1980 
tcttcccctg ctgcctctgt tttacctcac ccagcaatgg accgccccct ccagcctggc 2040 
tctgcaactg gaattgccta tgaccccttg atgctgaaac accagtgcgt ttgtggcaat 2100 
tccaccaccc accctgagca tgctggacga atacagagta tctggtcacg actgcaagaa 2160 
actgggctgc taaataaatg tgagcgaatt caaggtcgaa aagccagcct ggaggaaata 2220 
cagcttgttc attctgaaca tcactcactg ttgtatggca ccaaccccct ggacggacag 2280 
aagctggacc ccaggatact cctaggtgat gactctcaaa agtttttttc ctcattacct 2340 
tgtggtggac ttggggtgga cagtgacacc atttggaatg agctacactc gtccggtgct 2400 
gcacgcatgg ctgttggctg tgtcatcgag ctggcttcca aagtggcctc aggagagctg 2460 
aagaatgggt ttgctgttgt gaggccccct ggccatcacg ctgaagaatc cacagccatg 2520 
gggttctgct tttttaattc agttgcaatt accgccaaat acttgagaga ccaactaaat 2580 
ataagcaaga tattgattgt agatctggat gttcaccatg gaaacggtac ccagcaggcc 2640 
ttttatgctg accccagcat cctgtacatt tcactccatc gctatgatga agggaacttt 2700 
ttccctggca gtggagcccc aaatgaggtt cggtttattt ctttagagcc ccacttttat 2760 
ttgtatcttt caggtaattg cattgcatga ttacccctaa ttttcttgtc ctttgctggt 2820 
gttttaaatt acacgagatt actgaattgt cccatgggac caagaaccag tgcagaacaa 2880 
gtgcataacc cagagcactg tttgtcaggg aaggttgggc tgatttgatg tgttgtttga 2940 
tgtttatttc aagagctccc atgtgcttgt tttcctctct tcttgctttc ttccatttgc 3000 
tctcttctct gcccaccgtg gtgtgtcttt ctcttcccag gttggaacag gccttggaga 3060 
agggtacaat ataaatattg cctggacagg tggccttgat cctcccatgg gagatgttga 3120 
gtaccttgaa gcattcagga ccatcgtgaa gcctgtggcc aaagagtttg atccagacat 3180 
ggtcttagta tctgctggat ttgatgcatt ggaaggccac acccctcctc taggagggta 3240 
caaagtgacg gcaaaatgtt ttggtcattt gacgaagcaa ttgatgacat tggctgatgg 3300 
acgtgtggtg ttggctctag aaggaggaca tgatctcaca gccatctgtg atgcatcaga 3360 
agcctgtgta aatgcccttc taggaaatga gctggagcca cttgcagaag atattctcca 3420 
ccaaagcccg aatatgaatg ctgttatttc tttacagaag atcattgaaa ttcaaagtat 3480 
gtctttaaag ttctcttaa 3499 

<210> 4 
<211> 879 
<212> PRT 

<213> Homo sapiens 



<400> 4 



Met 


His 


Ser 


Met 


He 


Ser 


Ser 


Val 


Asp 


Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 


1 








5 










10 










15 




Gly 


Leu 


Glu 


Pro 


He 


Ser 


Pro 


Leu 


Asp 


Leu 


Arg 


Thr 


Asp 


Leu 


Arg 


Met 








20 










25 










30 






Met 


Met 


Pro 


Val 


Val 


Asp 


Pro 


Val 


Val 


Arg 


Glu 


Lys 


Gin 


Leu 


Gin 


Gin 






35 










40 










45 








Glu 


Leu 


Leu 


Leu 


He 


Gin 


Gin 


Gin 


Gin 


Gin 


He 


Gin 


Lys 


Gin 


Leu 


Leu 




50 










55 










60 










lie 


Ala 


Glu 


Phe 


Gin 


Lys 


Gin 


His 


Glu 


Asn 


Leu 


Thr 


Arg 


Gin 


His 


Gin 


65 










70 










75 










80 


Ala 


Gin 


Leu 


Gin 


Glu 


His 


He 


Lys 


Glu 


Leu 


Leu 


Ala 


He 


Lys 


Gin 


Gin 










85 










90 










95 




Gin 


Glu 


Leu 


Leu 


Glu 


Lys 


Glu 


Gin 


Lys 


Leu 


Glu 


Gin 


Gin 


Arg 


Gin 


Glu 



100 105 HO 
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Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg 

115 120 125 

Gly Lys Asp Arg Gly Arg Glu Arg Ala Val Ala Ser Thr Glu Val Lys 

130 135 140 

Gin Lys Leu Gin Glu Phe Leu Leu Ser Lys Ser Ala Thr Lys Asp Thr 
145 150 155 1 160 

Pro Thr Asn Gly Lys Asn His Ser Val Ser Arg His Pro Lys Leu Trp 

165 170 175 

Tyr Thr Ala Ala His His Thr Ser Leu Asp Gin Ser Ser Pro Pro Leu 

180 185 190 

Ser Gly Thr Ser Pro Ser Tyr Lys Tyr Thr Leu Pro Gly Ala Gin Asp 

195 200 205 

Ala Lys Asp Asp Phe Pro Leu Arg Lys Thr Ala Ser Glu Pro Asn Leu 

210 215 220 

Lys Val Arg Ser Arg Leu Lys Gin Lys Val Ala Glu Arg Arg Ser Ser 
225 230 235 J 240 

Pro Leu Leu Arg Arg Lys Asp Gly Asn Val Val Thr Ser Phe Lys Lys 

245 250 255 

Arg Met Phe Glu Val Thr Glu Ser Ser Val Ser Ser Ser Ser Pro Gly 

260 265 270 

Ser Gly Pro Ser Ser Pro Asn Asn Gly Pro Thr Gly Ser Val Thr Glu 

275 280 285 

Asn Glu Thr Ser Val Leu Pro Pro Thr Pro His Ala Glu Gin Met Val 

290 295 300 

Ser Gin Gin Arg lie Leu He His Glu Asp Ser Met Asn Leu Leu Ser 
305 . 310 ' 315 320 

Leu Tyr Thr Ser Pro Ser Leu Pro Asn He Thr Leu Gly Leu Pro Ala 

325 330 335 

Val Pro Ser Gin Leu Asn Ala Ser Asn Ser Leu Lys Glu Lys Gin Lys 

340 345 ^ 350 

Cys Glu Thr Gin Thr Leu Arg Gin Gly Val Pro Leu Pro Gly Gin Tyr 

355 360 365 

Gly Gly Ser He Pro Ala Ser Ser Ser His Pro His Val Thr Leu Glu 

370 375 380 

Gly Lys Pro Pro Asn Ser Ser His Gin Ala Leu Leu Gin His Leu Leu 
385 390 395 400 

Leu Lys Glu Gin Met Arg Gin Gin Lys Leu Leu Val Ala Gly Gly Val 

405 410 415 

Pro Leu His Pro Gin Ser Pro Leu Ala Thr > Lys Glu Arg He Ser Pro 

420 425 430 

Gly He Arg Gly Thr His Lys Leu Pro Arg His Arg Pro Leu Asn Arg 

435 440 445 

Thr Gin Ser Ala Pro Leu Pro Gin Ser Thr Leu Ala Gin Leu Val lie 

450 455 460 

Gin Gin Gin His Gin Gin Phe Leu Glu Lys Gin Lys Gin Tyr Gin Gin 
465 470 475 480 

Gin He His Met Asn Lys Leu Leu Ser Lys Ser He Glu Gin Leu Lys 

485 490 495 

Gin Pro Gly Ser His Leu Glu Glu Ala Glu Glu Glu Leu Gin Gly Asp 

500 505 510 

Gin Ala Met Gin Glu Asp Arg Ala Pro Ser Ser Gly Asn Ser Thr Arg 

515 520 525 

Ser Asp Ser Ser Ala Cys Val Asp Asp Thr Leu Gly Gin Val Gly Ala 

530 535 540 

Val Lys Val Lys Glu Glu Pro Val Asp Ser Asp Glu Asp Ala Gin He 
5 45 550 555 . 560 

Gin Glu Met Glu Ser Gly Glu Gin Ala Ala Phe Met Gin Gin Pro Phe 

565 570 575 

Leu Glu Pro Thr His Thr Arg Ala Leu Ser Val Arg Gin Ala Pro Leu 

580 585 590 

Ala Ala Val Gly Met Asp Gly Leu Glu Lys His Arg Leu Val Ser Arg 
595 600 605 
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Thr 


His 


Ser 


Ser 


Pro Ala Ala Ser Val Leu 


Pro 


His 


Pro 


Ala 


Met 


Asp 




610 






615 




620 










Arg 


Pro 


Leu 


Gin 


Pro Gly Ser Ala Thr Gly 


He 


Ala 


Tyr 


Asp 


Pro 


Leu 


625 








630 


635 










640 


Met 


Leu 


Lys 


His 


Gin Cys Val Cys Gly Asn 


Ser 


Thr 


Thr 


His 


Pro 


Glu 










645 650 










655 




His 


Ala 


Gly 


Arg 


lie Gin Ser He Trp Ser 


Arg 


Leu 


Gin 


Glu 


Thr 


Gly 








660 


665 








670 






Leu 


Leu 


Asn 


Lys 


Cys Glu Arg He Gin Gly 


Arg 


Lys 


Ala 


Ser 


Leu 


Glu 






675 




680 






685 








Glu 


lie 


Gin 


Leu 


Val His Ser Glu His His 


Ser 


Leu 


Leu 


Tyr 


Gly 


Thr 




690 






695 




700 










Asn 


Pro 


Leu 


Asp 


Gly Gin Lys Leu Asp Pro 


Arg 


He 


Leu 


Leu 


Gly 


Asp 


705 








710 


715 










720 


Asp 


Ser 


Gin 


Lys 


Phe Phe Ser Ser Leu Pro 


Cys 


Gly 


Gly 


Leu 


Gly 


Val 










725 730 










735 




Asp 


Ser 


Asp 


Thr 


He Trp Asn Glu Leu His 


Ser 


Ser 


Gly 


Ala 


Ala 


Arg 








740 


745 








750 






Met 


Ala 


Val 


Gly 


Cys Val He Glu Leu Ala 


Ser 


Lys 


Val 


Ala 


Ser 


Gly 






755 




760 






765 








Glu 


Leu 


Lys 


Asn 


Gly Phe Ala Val Val Arg 


Pro 


Pro 


Gly 


His 


His 


Ala 




770 






775 




780 








Glu 


Glu 


Ser 


Thr 


Ala Met Gly Phe Cys Phe 


Phe 


Asn 


Ser 


Val 


Ala 


He 


785 








790 


795 










800 


Thr 


Ala 


Lys 


Tyr 


Leu Arg Asp Gin Leu Asn 


He 


Ser 


Lys 


lie 


Leu 


He 










805 810 










815 




Val 


Asp 


Leu 


Asp 


Val His His Gly Asn Gly 


Thr 


Gin 


Gin 


Ala 


Phe 


Tyr 








820 


825 








830 






Ala 


Asp 


Pro 


Ser 


He Leu Tyr He Ser Leu 


His 


Arg 


Tyr 


Asp 


Glu 


Gly 






835 




840 






845 








Asn 


Phe 


Phe 


Pro 


Gly Ser Gly Ala Pro Asn 


Glu 


Val 


Arg 


Phe 


He 


Ser 




850 






855 




860 










Leu 


Glu 


Pro 


His 


Phe Tyr Leu Tyr Leu Ser 


Gly 


Asn 


Cys 


He 


Ala 





865 870 875 



<210> 5 

<211> 3054 

<212> DMA 

<213> Homo sapiens 

<400> 5 

ggggaagaga ggcacagaca 
tgagggtttt tgcaacaaaa 
ggacgagagc agctcttggc 
aagtcagaag ttcctgtggg 
aggatgatga tgcccgtggt 
cttcttatcc agcagcagca 
cagcatgaga acttgacacg 
ctagccataa aacagcaaca 
caagaacagg aagtagagag 
gatagaggac gagaaagggc 
ctactgagta aatcagcaac 
cgccatccca agctctggta 
ccccttagtg gaacatctcc 
gatgatttcc cccttcgaaa 
cccagttcac caaacaatgg 
ccccctaccc ctcatgccga 
tccatgaacc tgctaagtct 
cccgcagtgc catcccagct 
acgcagacgc ttaggcaagg 
tcttccagcc accctcatgt 



cagataggag aagggcaccg 
ccctagcagc ctgaagaact 
tcagcaaaga atgcacagta 
cctggagccc atctcacctt 
ggaccctgtt gtccgtgaga 
acaaatccag aagcagcttc 
gcagcaccag gctcagcttc 
agaactccta gaaaaggagc 
gcatcgcaga gaacagcagc 
agtggcaagt acagaagtaa 
gaaagacact ccaactaatg 
cacggctgcc caccacacat 
atcctacaag tacacattac 
aactgaatcc tcagtcagta 
gccaactgga agtgttactg 
gcaaatggtt tcacagcaac 
ttatacctct ccttctttgc 
caatgcttcg aattcactca 
tgttcctctg cctgggcagt 
tactttagag ggaaagccac 



gctggagcca cttgcaggac 60 
ctaagccaga tggggtggct 120 
tgatcagctc agtggatgtg 180 
tagacctaag gacagacctc 240 
agcaattgca gcaggaatta 300 
tgatagcaga gtttcagaaa 360 
aggagcatat caaggaactt 420 
agaaactgga gcagcagagg 480 
ttcctcctct cagaggcaaa 540 
agcagaagct tcaagagttc 600 
gaaaaaatca ttccgtgagc 660 
cattggatca aagctctcca 720 
caggagcaca agatgcaaag 780 
gcagttctcc aggctctggt 840 
aaaatgagac ttcggttttg 900 
gcattctaat tcatgaagat 960 
ccaacattac cttggggctt 1020 
aagaaaagca gaagtgtgag 1080 
atggaggcag catcccggca 1140 
ccaacagcag ccaccaggct 1200 
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ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cc tgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
gttggctgtg tcatcgagct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
gctgttgtga ggccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttgg aacaggcctt ggagaagggt acaatataaa tattgcctgg 2640 
acaggtggcc ttgatcctcc catgggagat gttgagtacc ttgaagcatt caggaccatc 2700 
gtgaagcctg tggccaaaga gtttgatcca gacatggtct tagtatctgc tggatttgat 2760 
gcattggaag gccacacccc tcctctagga gggtacaaag tgacggcaaa atgttttggt 2820 
catttgacga agcaattgat gacattggct gatggacgtg tggtgttggc tctagaagga 2880 
ggacatgatc tcacagccat ctgtgatgca tcagaagcct gtgtaaatgc ccttctagga 2940 
aatgagctgg agccacttgc agaagatatt ctccaccaaa gcccgaatat gaatgctgtt 3000 
atttctttac agaagatcat tgaaattcaa agtatgtctt taaagttctc ttaa 3054 

<210> 6 

<211> 967 

<212> PRT 

<213> Homo sapiens 

<400> 6 



Met His Ser Met 


lie 


Ser 


Ser 


Val 


Asp Val 


Lys 


Ser 


Glu 


Val 


Pro Val 


1 


5 








10 










15 


Gly Leu Glu Pro 


He 


Ser 


Pro 


Leu 


Asp Leu 


Arg 


Thr 


Asp 


Leu 


Arg Met 


20 










25 








30 


Met Met Pro Val 


Val 


Asp 


Pro 


Val 


Val Arg 


Glu 


Lys 


Gin 


Leu 


Gin Gin 


35 








40 








45 






Glu Leu Leu Leu 


He 


Gin 


Gin 


Gin 


Gin Gin 


He 


Gin 


Lys 


Gin 


Leu Leu 


50 






55 








60 






He Ala Glu Phe 


Gin 


Lys 


Gin 


His 


Glu Asn 


Leu 


Thr 


Arg 


Gin 


His Gin 


65 




70 








75 






80 


Ala Gin Leu Gin 


Glu 


His 


He 


Lys 


Glu Leu 


Leu 


Ala 


He 


Lys 


Gin Gin 


Gin Glu Leu Leu 


85 








90 








95 


Glu 


Lys 


Glu 


Gin 


Lys Leu 


Glu 


Gin 


Gin 


Arg Gin Glu 


100 










105 








110 




Gin Glu Val Glu 


Arg 


His 


Arg 


Arg 


Glu Gin 


Gin 


Leu 


Pro 


Pro 


Leu Arg 


115 








120 








125 




Gly Lys Asp Arg 


Gly 


Arg 


Glu 


Arg 


Ala Val 


Ala 


Ser 


Thr 


Glu 


Val Lys 


130 






135 








140 






Gin Lys Leu Gin 


Glu 


Phe 


Leu 


Leu 


Ser Lys 


Ser 


Ala 


Thr 


Lys Asp Thr 


145 




150 








155 








160 


Pro Thr Asn Gly 


Lys 


Asn 


His 


Ser 


Val Ser 


Arg His 


Pro 


Lys 


Leu Trp 




165 








170 








175 


Tyr Thr Ala Ala 


His 


His 


Thr 


Ser 


Leu Asp 


Gin 


Ser 


Ser 


Pro 


Pro Leu 


180 










185 








190 
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Ser 


biy 


Thr 


Ser 


Pro 


Ser 


Tyr 


Lys 






1 y D 










200 




Lys 


Asp 


Asp 


Phe 


Pro 


Leu Arg 




ziu 










215 




Ser 


Ser 


Pro 


Gly Ser Gly 


Pro 


Ser 












230 






Ser 


vai 


inr 


Glu Asn 


Glu 


Thr 


Ser 










245 








bJ.U 


bin 


Jxiec 


Val 


Ser 


Gin 


Gin Arg 








260 










Asn 


Leu 


Leu 


Ser 


Leu Tyr 


Thr 


Ser 






I ID 










280 


biy 


Leu 


Pro 


Ala 


Val 


Pro 


Ser 


Gin 




ion 










295 




Glu 


Lys 


Gin 


Lys Cys 


Glu 


Thr Gin 


"3 A C 

3 05 










310 






Pro 


Gly 


Gin 


Tyr Gly Gly 


Ser 


He 










325 








Val 


Thr 


Leu 


Glu Gly Lys 


Pro 


Pro 








340 










Gin 


His 


Leu 


Leu 


Leu 


Lys 


Glu Gin 






355 










360 


Ala 


Gly 


Gly 


Val 


Pro 


Leu 


His 


Pro 




370 










375 




Arg 


lie 


Ser 


Pro Gly 


He 


Arg Gly 


385 










390 






Pro 


Leu 


Asn 


Arg 


Thr 


Gin 


Ser 


Ala 










405 








Gin 


Leu 


Val 


lie 


Gin 


Gin 


Gin 


His 








420 










Gin 


Tyr 


Gin 


Gin 


Gin 


He 


His 


Met 






435 










440 


Glu 


Gin 


Leu 


Lys 


Gin 


Pro 


Gly 


Ser 




450 










455 




Leu 


Gin 


Gly 


Asp 


Gin 


Ala 


Met 


Gin 


A C IT 










470 






Asn 


Ser 


Thr 


Arg 


Ser Asp 


Ser 


Ser 










485 








pi « 

Gin 


vai 


PI , - 

Gly 


Ala 


Val 


Lys 


Val 


Lys 








500 










Asp 


T\ "1 _ 

Ala 


Gin 


He 


Gin 


Glu 


Met 


Glu 






Dlb 










520 


bin 


p i ~ 
bin 


Pro 


Phe 


Leu 


Glu 


Pro 


Thr 














535 




pi ^ 

Gin 


Ala 


Pro 


Leu 


Ala 


Ala 


Val Gly 












550 






Leu 


Val 




Arg Thr His 


Ser 


Ser 










565 








Pro 


AT 0 




Asp Arg 


Pro 


Leu 


Gin 








580 










Tyr 


Asp 


Pro 


Leu 


Met 


Leu 


Lys 


His 






595 










600 


Thr 


His 


Pro 


Glu His 


Ala 


Gly Arg 




610 










615 




Gin 


Glu 


Thr 


Gly Leu Leu 


Asn Lys 


625 










630 






Ala 


Ser 


Leu 


Glu 


Glu 


He 


Gin 


Leu 










645 








Leu 


Tyr 


Gly 


Thr 


Asn 


Pro 


Leu 


Asp 








660 










Leu 


Leu 


Gly 


Asp Asp 


Ser 


Gin Lys 






675 










680 
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Thr 


Leu 


Pro 




Ala 


Gin 


Asp 










205 








Lys 


Thr 


Glu 


Ser 


Ser 


Val 


Ser 


Ser 








220 










iJCi. 


Pro 


Asn 


Asn 




Pro 


Thr 








235 










240 


Val 


Leu 


Pro 


Pro 


Thr 


Pro 


His 


Ala 




£» J vj 










255 






TiPll 


He 


His 


Glu 




OCl 


Met 


^ U J 
















XT i U 


Qoy 


UcU 


XT J. KJ 


rVsn 


lie 


X ilX, 


UCU. 










0 ~J 








- 

Leu 


Asn 


nld 


OCX. 


noli 




UCU 










■3 no. 










xnr 


Leu 


. 

Arg 


PI 71 
VjjJ.Il 


r*T it- 
vjj.y 


val 


Pro 


Leu 






Jlj 










•J c, \J 


Pro 


Ala 


Ser 


Qor- 


bsr 


IT. IS 


Pro 


nib 




^ A 










j j j 




Asn 


Ser 


Ser 


nis 


pi „ 
o±n 


A±a 


Leu 


Leu 


•3 / c 










OCA 






TUT „ +- 

Met 


Arg 


pi — 
bin 


bin 


Lys 


Leu 


Leu 


vai 










jOD 








p i _ 
bin 


Ser 


Pro 


Leu 


Ala 


inr 


Lys 


VjIU 








ion 










inr 


WIS 


Lys 


Leu 


Pro 


Arg 


riliS 


Arg 
















Ann 


Pro 


Leu 


Pro 


ply-, 

bin 


Ser 


Thr 


Leu 


rixa 




/in 
41U 










/i 1 ^ 

4±1D 




Gin 


Gin 


Phe 


Leu 


pi it 

blU 


Lys 


pi _ 

bin 


Lys 


425 










/ion 






Asn 


Lys 


Leu 


Leu 


Ser 


Lys 


Ser 


Tift 

lie 










AA c: 








His 


Leu 




J.U 


nla 


pi , n 

ol U. 


Pi n 

Ul U. 


Pi n 
uiu 


















Glu 


Asp 


Arg 


Aia 


Pro 


Ser 


Ser 








Alt* 










Ann 


Ala 


Cys 


Val 


Asp 


Asp 


1 IlX 


Leu 


Pi -t 1- 

\3 ±.y 




490 














Glu 


Glu 


Pro 


val 


A an 
nop 


Del 




P"| -I} 


505 










sin 






Ser Gly 


pi 


pi r, 
bin 


A 1 =s 

nla 


AT a 


xrXlti 


Mph 
1*1 U 










j« j 








His 


Thr 






UCU 


Cpr 


V ax 


**i y 


















Met Asp 


Pi v 




P*l 11 

UJ. U. 


Lys 


His 








jjj 










560 


Pro 


Ala 


Ala 


OCX. 


Val 


Leu 


Pro 


His 




570 










575 




Pro Gly 


Ser 


Ala 


Thr 


Glv 


He 


Ala 


585 










590 






Gin Cys 


Val 


Cys 


Gly 


Asn 


Ser 


Thr 










605 








He 


Gin 


Ser 


He' 


Trp 


Ser 


Arg 


Leu 








620 










Cys 


Glu 


Arg 


He 


Gin 


Gly 


Arg 


Lys 






635 










640 


Val 


His 


Ser 


Glu 


His 


His 


Ser 


Leu 




650 










655 




Gly Gin 


Lys 


Leu 


Asp 


Pro 


Arg 


He 


665 










670 






Phe 


Phe 


Ser 


Ser 


Leu 


Pro 


Cys 


Gly 



685 
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Leu 


Gly 


Val 


Asp Ser Asp Thr He Trp 


Asn Glu Leu 


His 


Ser Ser 




690 






695 


700 






Gly 


Ala 


Ala 


Arg 


Met Ala Val Gly Cys Val 


He Glu Leu 


Ala 


Ser Lys 


705 








710 


715 




720 


Val 


Ala 


Ser 


Gly 


Glu Leu Lys Asn Gly Phe 


Ala Val Val 


Arg 


Pro Pro 










725 730 






735 


Gly 


His 


His 


Ala 


Glu Glu Ser Thr Ala Met 


Gly Phe Cys 


Phe 


Phe Asn 








740 


745 




750 




Ser 


Val 


Ala 


He 


Thr Ala Lys Tyr Leu Arg 


Asp Gin Leu 


Asn 


He Ser 






755 




760 


765 






Lys 


He 


Leu 


He 


Val Asp Leu Asp Val His 


His Gly Asn 


Gly Thr Gin 




770 






775 


780 






Gin 


Ala 


Phe 


Tyr 


Ala Asp Pro Ser He Leu 


Tyr He Ser 


Leu 


His Arg 


785 








790 


795 




800 


Tyr 


Asp 


Glu 


Gly 


Asn Phe Phe Pro Gly Ser 


Gly Ala Pro 


Asn 


Glu Val 










805 810 






815 


Gly 


Thr 


Gly 


Leu 


Gly Glu Gly Tyr Asn He 


Asn He Ala 


Trp 


Thr Gly 








820 


825 




830 




Gly 


Leu 


Asp 


Pro 


Pro Met Gly Asp Val Glu 


Tyr Leu Glu 


Ala 


Phe Arg 






835 




840 


845 






Thr 


He 


Val 


Lys 


Pro Val Ala Lys Glu Phe 


Asp Pro Asp 


Met 


Val Leu 




850 






855 


860 






Val 


Ser 


Ala 


Gly 


Phe Asp Ala Leu Glu Gly 


His Thr Pro 


Pro Leu Gly 


865 








870 


875 




880 


Gly 


Tyr 


Lys 


Val 


Thr Ala Lys Cys Phe Gly 


His Leu Thr 


Lys 


Gin Leu 










885 890 






895 


Met 


Thr 


Leu 


Ala 


Asp Gly Arg Val Val Leu 


Ala Leu Glu 


Gly Gly His 








900 


905 




910 




Asp 


Leu 


Thr 


Ala 


He Cys Asp Ala Ser Glu 


Ala Cys Val 


Asn 


Ala Leu 






915 




920 


925 






Leu 


Gly 


Asn 


Glu 


Leu Glu Pro Leu Ala Glu 


Asp He Leu 


His 


Gin Ser 




930 






935 


940 






Pro 


Asn 


Met 


Asn 


Ala Val He Ser Leu Gin 


Lys lie He 


Glu 


He Gin 


945 








950 


955 




960 


Ser 


Met 


Ser 


Leu 


Lys Phe Ser 









965 



<210> 7 

<211> 3367 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 3 60 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgaatcc tcagtcagta gcagttctcc aggctctggt 840 
cccagttcac caaacaatgg gccaactgga agtgttactg aaaatgagac ttcggttttg 900 
ccccctaccc ctcatgccga gcaaatggtt tcacagcaac gcattctaat tcatgaagat 960 
tccatgaacc tgctaagtct ttatacctct ccttctttgc ccaacattac cttggggctt 1020 
cccgcagtgc catcccagct caatgcttcg aattcactca aagaaaagca gaagtgtgag 1080 
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acgcagacgc ttaggcaagg tgttcctctg cctgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cctgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
gttggctgtg tcatcgagct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
gctgttgtga ggccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttcg gtttatttct ttagagcccc acttttattt gtatctttca 2640 
ggtaattgca ttgcatgatt acccctaatt ttcttgtcct ttgctggtgt tttaaattac 2700 
acgagattac tgaattgtcc catgggacca agaaccagtg cagaacaagt gcataaccca 2760 
gagcactgtt tgtcagggaa ggttgggctg atttgatgtg ttgtttgatg tttatttcaa 2820 
gagctcccat gtgcttgttt tcctctcttc ttgctttctt ccatttgctc tcttctctgc 2880 
ccaccgtggt gtgtctttct cttcccaggt tggaacaggc cttggagaag ggtacaatat 2940 
aaatattgcc tggacaggtg gccttgatcc tcccatggga gatgttgagt accttgaagc 3000 
attcaggacc atcgtgaagc ctgtggccaa agagtttgat ccagacatgg tcttagtatc 3060 
tgctggattt gatgcattgg aaggccacac ccctcctcta ggagggtaca aagtgacggc 3120 
aaaatgtttt ggtcatttga cgaagcaatt gatgacattg gctgatggac gtgtggtgtt 3180 
ggctctagaa ggaggacatg atctcacagc catctgtgat gcatcagaag cctgtgtaaa 3240 
tgcccttcta ggaaatgagc tggagccact tgcagaagat attctccacc aaagcccgaa 3300 
tatgaatgct gttatttctt tacagaagat cattgaaatt caaagtatgt ctttaaagtt 3360 
ctcttaa 3367 

<210> 8 
<211> 835 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Met His Ser Met lie Ser Ser Val Asp Val Lys Ser Glu Val Pro Val 

1 5 10 15 

Gly Leu Glu Pro He Ser Pro Leu Asp Leu Arg Thr Asp Leu Arg Met 

20 25 30 

Met Met Pro Val Val Asp Pro Val Val Arg Glu Lys Gin Leu Gin Gin 

35 40 45 

Glu Leu Leu Leu He Gin Gin Gin Gin Gin He Gin Lys Gin Leu Leu 

50 55 60 

He Ala Glu Phe Gin Lys Gin His Glu Asn Leu Thr Arg Gin His Gin 
65 70 75 ~ 80 

Ala Gin Leu Gin Glu His He Lys Glu Leu Leu Ala Xle Lys Gin Gin 

85 90 95 

Gin Glu Leu Leu Glu Lys Glu Gin Lys Leu Glu Gin Gin Arg Gin Glu 

100 105 110 

Gin Glu Val Glu Arg His Arg Arg Glu Gin Gin Leu Pro Pro Leu Arg 
115 120 125 
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Lys 


Asp 


Arg Gly Arg Glu Arg 


Aia 


vai 


Ala 


Ser Thr 


Glu Val 


Lys 








1 J J 








140 










Lys 


Leu 


Gin vj±u rxie Lieu i>eu 


Ser 


Lys 


Ser 


Ala Thr 


Lys Asp 


Thr 








1 CA 






155 








160 


Pro 


Thr 


Asn 


Giy Lys Asn his oer 


vai 


Ser 


Arg His Pro 


Lys Leu Trp 








loo 




1 m 
1 / u 








175 




Tyr 


Thr 


Ala 


Ala His his inr ber 


Leu 


Asp 


Gin 


Ser Ser 


Pro 


Pro 


Leu 








1 O A 

loU 


lOD 








190 






Ser 


Gly 


Thr 


Ser Pro Ser Tyr Lys 


Tyr 


rnr 


Leu 


Pro Gly 


Ala 


Gin Asp 






195 










205 








Ala 


Lys 


Asp 


Asp Phe Pro Leu Arg 


Lys 


Thr 


Glu 


Ser Ser 


Val 


Ser 


Ser 








ZlO 








220 








Ser 


Ser 


Pro 


Gly Ser Gly Pro Ser 


Ser 


Pro 


Asn Asn Gly 


Pro 


Thr Gly 


225 












235 








240 


Ser 


Val 


Thr 


Glu Asn Glu Thr Ser 


Val 


Leu 


Pro 


Pro Thr 


Pro 


His 


Ala 








245 




250 








255 




Glu 


Gin 


Met 


Val Ser Gin Gin Arg 


He 


Leu 


He 


His Glu 


Asp 


Ser 


Met 








260 


265 








270 






Asn 


Leu 


Leu 


Ser Leu Tyr Thr Ser 


Pro 


Ser 


Leu 


Pro Asn 


He 


Thr 


Leu 






275 


280 








285 








Gly 


Leu 


Pro 


Ala Val Pro Ser Gin 


Leu 


Asn 


Ala 


Ser Asn 


Ser 


Leu 


Lys 




290 




295 








300 








Glu 


Lys 


Gin 


Lys Cys Glu Thr Gin 


Thr 


Leu 


Arg Gin Gly 


Val 


Pro 


Leu 


305 






310 






315 








320 


Pro 


Gly 


Gin 


Tyr Gly Gly Ser He 


Pro 


Ala 


Ser 


Ser Ser 


His 


Pro 


His 








325 




330 








335 




Val 


Thr 


Leu 


Glu Gly Lys Pro Pro 


Asn 


Ser 


Ser 


His Gin 


Ala 


Leu 


Leu 








340 


345 








350 






Gin 


His 


Leu 


Leu Leu Lys Glu Gin 


Met 


Arg 


Gin Gin Lys 


Leu 


Leu 


Val 






355 


360 








365 








Ala 


Gly 


Gly 


Val Pro Leu His Pro 


Gin 


Ser 


Pro 


Leu Ala 


Thr 


Lys 


Glu 




370 




375 








380 








Arg 


lie 


Ser 


Pro Gly He Arg Gly 


Thr 


His 


Lys 


Leu Pro 


Arg 


His 


Arg 


Tor 

385 






390 






395 








400 


Pro 


Leu 


Asn 


Arg Thr Gin Ser Ala 


Pro 


Leu 


Pro 


Gin Ser 


Thr 


Leu 


Ala 








405 




410 








415 




Gin 


Leu 


Val 


He Gin Gin Gin His 


Gin 


Gin 


Phe 


Leu Glu 


Lys 


Gin 


Lys 








420 


425 








430 






Gin 


Tyr 


Gin 


Gin Gin He His Met 


Asn 


Lys 


Leu 


Leu Ser 


Lys 


Ser 


He 






435 


440 








445 








Glu 


Gin 


Leu 


Lys Gin Pro Gly Ser 


His 


Leu 


Glu 


Glu Ala 


Glu 


Glu 


Glu 




A c a 
450 




455 








460 








Leu 


Gin 


Gly 


Asp Gin Ala Met Gin 


Glu 


Asp 


Arg 


Ala Pro 


Ser 


Ser 


Gly 


A 4Z. fZ 

465 






470 






475 








480 


Asn 


Ser 


Thr 


Arg Ser Asp Ser Ser 


Ala 


Cys 


Val 


Asp Asp 


Thr 


Leu 


Gly 








485 




A O A 

4yu 








495 




Gin 


Vai 


Gly 


Ala Val Lys Val Lys 


GlU 


GlU 


Pro 


Val Asp 


Ser 


Asp 


Glu 








c a a 


DUO 








510 






ASp 


Aia 


Gin 


He Gin Glu Met Glu 


Ser 


Gly 


Glu 


Gin Ala 


Ala 


Phe 


Met 






CI c 










525 








tjJLn 


(jin 


Pro 


Phe Leu Glu Pro Thr 


rllS 


inr 


Arg Ala Leu 


Ser Val 


Arg 




530 




535 








540 








Gin 


Ala 


Pro 


Leu Ala Ala Val Gly 


Met 


Asp 


Gly Leu Glu 


Lys 


His 


Arg 


545 






550 






555 








560 


Leu 


Val 


Ser 


Arg Thr His Ser Ser 


Pro 


Ala 


Ala 


Ser Val 


Leu 


Pro 


His 








565 




570 








575 




Pro 


Ala 


Met 


Asp Arg Pro Leu Gin 


Pro 


Gly 


Ser Ala Thr 


Gly He Ala 








580 


585 








590 






Tyr 


Asp 


Pro 


Leu Met Leu Lys His 


Gin 


Cys 


Val Cys Gly 


Asn 


Ser 


Thr 






595 


600 








605 








Thr 


His 


Pro 


Glu His Ala Gly Arg 


He 


Gin 


Ser 


He Trp 


Ser Arg Leu 




610 




615 








620 
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Gin 


Glu 


Thr 


Gly Leu 


Leu Asn Lys Cys 


Glu 


Arg 


He Gin Gly Arg Lys 


625 








630 




635 


640 


Ala 


Ser 


Leu. 


Glu Glu 


He Gin Leu Val 


His 


Ser 


Glu His His Ser Leu 








645 




650 




655 


Leu 


Tvr 


Gly 


Thr Asn 


Pro Leu Asn Glv 


Gin 


Lys 


Leu Asp Pro Arg lie 








660 


665 






670 


L6U 


Leu 


Glv 

\3J~X 


Asn Asr> 


Ser Gin Lys Phe 


Phe 


Ser 


Ser Leu Pro Cys Gly 






675 




680 






685 


Glv 


LeU 


Glv 




Qci-r Acvn Thr Tie 


Tm 


Asn 


Glu TiPU Hie; Ser Ser 




690 






695 

u 






700 


Glv 


Ala 


Ala 


Arcr Met" 


Ala Val Glv Cvs 


Val 


He 


Glu Leu Ala Ser Lvs 


705 








71 0 




715 


720 


Val 


Ala 




Glv Glu 


Leu TtV^ A<5n Glv 


Phe 


Ala 


Val Val Ara Pro Pro 








/ 6 J 




7*30 




735 




His 


nib 


Jla nil? 


I^JT 7 > Q&y r pVi y» Ala 

WXU OCX -L IX -L rtXU 




Glv 


Phe rvt; php Phe A^n 

flic jf O n i ic xriic rion 










745 






750 


Ser 


Val 


Ala 


lie Thr 


Ala Lys Tyr Leu 


Arg 


Asp 


Gin Leu Asn He Ser 






755 




760 






765 


Lys 


He 


Leu 


He Val 


Asp Leu Asp Val 


His 


His 


Gly Asn Gly Thr Gin 




770 






775 






780 


Gin 


Ala 


Phe 


Tyr Ala 


Asp Pro Ser He 


Leu 


Tyr 


He Ser Leu His Arg 


785 








790 




795 


800 


Tyr 


Asp 


Glu 


Gly Asn 


Phe Phe Pro Gly 


Ser 


Gly 


Ala Pro Asn Glu Val 








805 




810 




815 


Arg 


Phe 


He 


Ser Leu 


Glu Pro His Phe 


Tyr 


Leu 


Tyr Leu Ser Gly Asn 








820 


825 






830 


Cys 


He 


Ala 
















835 













<210> 9 

<211> 1791 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ggggaagaga ggcacagaca cagataggag aagggcaccg getggageca ettgeaggae 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact etaagecaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga ageaattgea gcaggaatta 3 00 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
etagecataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag geategcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgeaaag 780 
gatgatttcc cccttcgaaa aactgaatcc tcagtcagta gcagttctcc aggctctggt 840 
cccagttcac caaacaatgg gccaactgga agtgttactg aaaatgagac ttcggttttg 900 
ccccctaccc ctcatgccga gcaaatggtt tcacagcaac gcattctaat tcatgaagat 960 
tccatgaacc tgetaagtet ttatacctct ccttctttgc ccaacattac cttggggctt 1020 
cccgcagtgc catcccagct caatgetteg aattcactca aagaaaagca gaagtgtgag 1080 
acgcagacgc ttaggcaagg tgttcctctg cc tgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagecac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgegacage aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgee ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
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atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
gtaataggca aagatttagc tccaggattt gtaattaaag tcattatctg a 1791 

<210> 10 

<211> 546 

<212> PRT 

<213> Homo sapiens 



<400> 10 



Met 


His 


Ser 


Met 


lie 


Ser 


Ser 


Val 


Asp 


Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 


l 








b 










10 










15 




Gly 


Leu 


Glu 


Pro 


lie 


Ser 


Pro 


Leu 


Asp 


Leu 


Arg 


Thr 


Asp 


Leu 


Arg 


Met 








20 










25 










30 






Met 


Met 


Pro 


Val 


Val 


Asp 


Pro 


Val 


Val 


Arg 


Glu 


Lys 


Gin 


Leu 


Gin 


Gin 






35 










40 










45 
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900 905 910 

Thr Val Val Met Pro He Ala Ser Glu Phe Ala Pro Asp Val Val Leu 

915 920 925 

Val Ser Ser Gly Phe Asp Ala Val Glu Gly His Pro Thr Pro Leu Gly 

930 935 940 

Gly Tyr Asn Leu Ser Ala Arg Cys Phe Gly Tyr Leu Thr Lys Gin Leu 
945 950 955 960 

Met Gly Leu Ala Gly Gly Arg He Val Leu Ala Leu Glu Gly Gly His 

965 970 975 

Asp Leu Thr Ala He Cys Asp Ala Ser Glu Ala Cys Val Ser Ala Leu 

980 985 990 

Leu Gly Asn Glu Leu Asp Pro Leu Pro Glu Lys Val Leu Gin Gin Arg 

995 1000 1005 

Pro Asn Ala Asn Ala Val Arg Ser Met Glu Lys Val Met Glu He His 

1010 1015 1020 

Ser Lys Tyr Trp Arg Cys Leu Gin Arg Thr Thr Ser Thr Ala Gly Arg 
1025 1030 1035 1040 

Ser Leu He Glu Ala Gin Thr Cys Glu Asn Glu Glu Ala Glu Thr Val 

1045 1050 1055 

Thr Ala Met Ala Ser Leu Ser Val Gly Val Lys Pro Ala Glu Lys Arg 

1060 1065 1070 

Pro Asp Glu Glu Pro Met Glu Glu Glu Pro Pro Leu 
1075 1080 



<210> 13 
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<400> 13 

ggggaagaga ggcacagaca cagataggag 
tgagggtttt tgcaacaaaa ccctagcagc 
ggacgagagc agctcttggc tcagcaaaga 
aagtcagaag ttcctgtggg cctggagccc 
aggatgatga tgcccgtggt ggaccctgtt 
cttcttatcc agcagcagca acaaatccag 
cagcatgaga acttgacacg gcagcaccag 
ctagccataa aacagcaaca agaactccta 
caagaacagg aagtagagag gcatcgcaga 
gatagaggac gagaaagggc agtggcaagt 
ctactgagta aatcagcaac gaaagacact 
cgccatccca agctctggta cacggctgcc 
ccccttagtg gaacatctcc atcctacaag 
gatgatttcc cccttcgaaa aactgcctct 
aaacagaaag tggcagagag gagaagcagc 
gtcacttcat tcaagaagcg aatgtttgag 
ccaggctctg gtcccagttc accaaacaat 
acttcggttt tgccccctac ccctcatgcc 
attcatgaag attccatgaa cctgctaagt 
accttggggc ttcccgcagt gccatcccag 
cagaagtgtg agacgcagac gcttaggcaa 
agcatcccgg catcttccag ccaccctcat 
agccaccagg ctctcctgca gcatttatta 
cttgtagctg gtggagttcc cttacatcct 
tcacctggca ttagaggtac ccacaaattg 
tctgcacctt tgcctcagag cacgttggct 
ttcttggaga agcagaagca ataccagcag 
tctattgaac aactgaagca accaggcagt 
ggggaccagg cgatgcagga agacagagcg 
agcagtgctt gtgtggatga cacactggga 
ccagtggaca gtgatgaaga tgctcagatc 
tttatgcaac aggtaatagg caaagattta 
tgacctttcc tggaacccac gcacacacgt 
gcggttggca tggatggatt agagaaacac 
gctgcctctg ttttacctca cccagcaatg 
ggaattgcct atgacccctt gatgctgaaa 
caccctgagc atgctggacg aatacagagt 
ctaaataaat gtgagcgaat tcaaggtcga 
cattctgaac atcactcact gttgtatggc 
cccaggatac tcctaggtga tgactctcaa 
cttggggtgg acagtgacac catttggaat 
gctgttggct gtgtcatcga gctggcttcc 
tttgctgttg tgaggccccc tggccatcac 
ttttttaatt cagttgcaat taccgccaaa 
atattgattg tagatctgga tgttcaccat 
gaccccagca tcctgtacat ttcactccat 
agtggagccc caaatgaggt tcggtttatt 
tcaggtaatt gcattgcatg attaccccta 
tacacgagat tactgaattg tcccatggga 
ccagagcact gtttgtcagg gaaggttggg 
caagagctcc catgtgcttg ttttcctctc 
tgcccaccgt ggtgtgtctt tctcttccca 
tataaatatt gcctggacag gtggccttga 
agcattcagg accatcgtga agcctgtggc 
atctgctgga tttgatgcat tggaaggcca 
ggcaaaatgt tttggtcatt tgacgaagca 
gttggctcta gaaggaggac atgatctcac 



19/25 



aagggcaccg gctggagcca cttgcaggac 60 
ctgaagaact ctaagccaga tggggtggct 120 
atgcacagta tgatcagctc agtggatgtg 180 
atctcacctt tagacctaag gacagacctc 240 
gtccgtgaga agcaattgca gcaggaatta 300 
aagcagcttc tgatagcaga gtttcagaaa 3 60 
gctcagcttc aggagcatat caaggaactt 420 
gaaaaggagc agaaactgga gcagcagagg 480 
gaacagcagc ttcctcctct cagaggcaaa 540 
acagaagtaa agcagaagct tcaagagttc 600 
ccaactaatg gaaaaaatca ttccgtgagc 660 
caccacacat cattggatca aagctctcca 720 
tacacattac caggagcaca agatgcaaag 780 
gagcccaact tgaaggtgcg gtccaggtta 840 
cccttactca ggcggaagga tggaaatgtt 900 
gtgacagaat cctcagtcag tagcagttct 960 
gggccaactg gaagtgttac tgaaaatgag 1020 
gagcaaatgg tttcacagca acgcattcta 1080 
ctttatacct ctccttcttt gcccaacatt 1140 
ctcaatgctt cgaattcact caaagaaaag 1200 
ggtgttcctc tgcctgggca gtatggaggc 12 60 
gttactttag agggaaagcc acccaacagc 1320 
ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cagtctccct tggcaacaaa agagagaatt 1440 
ccccgtcaca gacccctgaa ccgaacccag 1500 
cagctggtca ttcaacagca acaccagcaa 15 60 
cagatccaca tgaacaaact gctttcgaaa 1620 
caccttgagg aagcagagga agagcttcag 1680 
ccctctagtg gcaacagcac taggagcgac 1740 
caagttgggg ctgtgaaggt caaggaggaa 1800 
caggaaatgg aatctgggga gcaggctgct 1860 
gctccaggat ttgtaattaa agtcattatc 1920 
gcgctctctg tgcgccaagc tccgctggct 1980 
cgtctcgtct ccaggactca ctcttcccct 2040 
gaccgccccc tccagcctgg ctctgcaact 2100 
caccagtgcg tttgtggcaa ttccaccacc 2160 
atctggtcac gactgcaaga aactgggctg 2220 
aaagccagcc tggaggaaat acagcttgtt 2280 
accaaccccc tggacggaca gaagctggac 2340 
aagttttttt cctcattacc ttgtggtgga 2400 
gagctacact cgtccggtgc tgcacgcatg 2460 
aaagtggcct caggagagct gaagaatggg 2520 
gctgaagaat ccacagccat ggggttctgc 2580 
tacttgagag accaactaaa tataagcaag 2640 
ggaaacggta cccagcaggc cttttatgct 2700 
cgctatgatg aagggaactt tttccctggc 2760 
tctttagagc cccactttta tttgtatctt 2820 
attttcttgt cctttgctgg tgttttaaat 2880 
ccaagaacca gtgcagaaca agtgcataac 2940 
ctgatttgat gtgttgtttg atgtttattt 3000 
ttcttgcttt cttccatttg ctctcttctc 3060 
ggttggaaca ggccttggag aagggtacaa 3120 
tcctcccatg ggagatgttg agtaccttga 3180 
caaagagttt gatccagaca tggtcttagt 3240 
cacccctcct ctaggagggt acaaagtgac 3300 
attgatgaca ttggctgatg gacgtgtggt 3360 
agccatctgt gatgcatcag aagcctgtgt 3420 
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aaatgccctt ctaggaaatg agctggagcc acttgcagaa gatattctcc accaaagccc 3480 
gaatatgaat gctgttattt ctttacagaa gatcattgaa attcaaagta tgtctttaaa 3540 
gttctcttaa ~ 3550 

<210> 14 

<211> 7699 > 

<212> DNA 

<213> Homo sapiens 



<400> 14 

cccattcgcc attcaggctg cgcaactgtt 
tattacgcca gctggcgaaa gggggatgtg 
gggttttccc agtcacgacg ttgtaaaacg 
ttggccatta gccatattat tcattggtta 
attgcatacg ttgtatccat atcataatat 
accgccatgt tgacattgat tattgactag 
agttcatagc ccatatatgg agttccgcgt 
cgaccgccca gcgacccccg cccgttgacg 
ccaataggga ctttccattg acgtcaatgg 
gcagtacatc aagtgtatca tatgccaagt 
tggcccgcct agcattatgc ccagtacatg 
atctacgtat tagtcatcgc tattaccatg 
cgtggatagc ggtttgactc acggggattt 
agtttgtttt ggcaccaaaa tcaacgggac 
ttgacgcaaa tgggcggtag gcgtgtacgg 
gtgaaccgtc agaattcaag cttgcggccg 
gcacagtatg atcagctcag tggatgtgaa 
ctcaccttta gacctaagga cagacctcag 
ccgtgagaag caattgcagc aggaattact 
gcagcttctg atagcagagt ttcagaaaca 
tcagcttcag gagcatatca aggaacttct 
aaaggagcag aaactggagc agcagaggca 
acagcagctt cctcctctca gaggcaaaga 
agaagtaaag cagaagcttc aagagttcct 
aactaatgga aaaaatcatt ccgtgagccg 
ccacacatca ttggatcaaa gctctccacc 
cacattacca ggagcacaag atgcaaagga 
gcccaacttg aaggtgcggt ccaggttaaa 
cttactcagg cggaaggatg gaaatgttgt 
gacagaatcc tcagtcagta gcagttctcc 
gccaactgga agtgttactg aaaatgagac 
gcaaatggtt tcacagcaac gcattctaat 
ttatacctct ccttctttgc ccaacattac 
caatgcttcg aattcactca aagaaaagca 
tgttcctctg cctgggcagt atggaggcag 
tactttagag ggaaagccac ccaacagcag 
gaaagaacaa atgcgacagc aaaagcttct 
gtctcccttg gcaacaaaag agagaatttc 
ccgtcacaga cccctgaacc gaacccagtc 
gctggtcatt caacagcaac accagcaatt 
gatccacatg aacaaactgc tttcgaaatc 
ccttgaggaa gcagaggaag agcttcaggg 
ctctagtggc aacagcacta ggagcgacag 
agttggggct gtgaaggtca aggaggaacc 
ggaaatggaa tctggggagc aggctgcttt 
cacacgtgcg ctctctgtgc gccaagctcc 
gaaacaccgt ctcgtctcca ggactcactc 
agcaatggac cgccccctcc agcctggctc 
gctgaaacac cagtgcgttt gtggcaattc 
acagagtatc tggtcacgac tgcaagaaac 
aggtcgaaaa gccagcctgg aggaaataca 
gtatggcacc aaccccctgg acggacagaa 



gggaagggcg atcggtgcgg gcctcttcgc 60 
ctgcaaggcg attaagttgg gtaacgccca 120 
acggccagtg ccaagctgat ctaatcaata 180 
tatagcataa atcaatattg gctattggcc 240 
gtacatttat attggctcat gtccaacatt 300 
ttattaatag taatcaatta cggggtcatt 360 
tacataactt acggtaaatg gcccgcctgg 420 
tcaatagtga cgtatgttcc catagtaacg 480 
gtggagtatt tacggtaaac tgcccacttg 540 
ccgcccccta ttgacgtcaa tgacggtaaa 600 
accttacggg agtttcctac ttggcagtac 660 
gtgatgcggt tttggcagta caccaatggg 720 
ccaagtctcc accccattga cgtcaatggg 780 
tttccaaaat gtcgtaataa ccccgccccg 840 
tgggaggtct atataagcag agctcgttta 900 
cagatctatc gatctgcagg atatcaccat 960 
gtcagaagtt cctgtgggcc tggagcccat 1020 
gatgatgatg cccgtggtgg accctgttgt 1080 
tcttatccag cagcagcaac aaatccagaa 1140 
gcatgagaac ttgacacggc agcaccaggc 1200 
agccataaaa cagcaacaag aactcctaga 1260 
agaacaggaa gtagagaggc atcgcagaga 1320 
tagaggacga gaaagggcag tggcaagtac 1380 
actgagtaaa tcagcaacga aagacactcc 1440 
ccatcccaag ctctggtaca cggctgccca 1500 
ccttagtgga acatctccat cctacaagta 1560 
tgatttcccc cttcgaaaaa ctgcctctga 1620 
acagaaagtg gcagagagga gaagcagccc 1680 
cacttcattc aagaagcgaa tgtttgaggt 1740 
aggctctggt cccagttcac caaacaatgg 1800 
ttcggttttg ccccctaccc ctcatgccga 1860 
tcatgaagat tccatgaacc tgctaagtct 1920 
cttggggctt cccgcagtgc catcccagct 1980 
gaagtgtgag acgcagacgc ttaggcaagg 2040 
catcccggca tcttccagcc accctcatgt 2100 
ccaccaggct ctcctgcagc atttattatt 2160 
tgtagctggt ggagttccct tacatcctca 2220 
acctggcatt agaggtaccc acaaattgcc 2280 
tgcacctttg cctcagagca cgttggctca 2340 
cttggagaag cagaagcaat accagcagca 2400 
tattgaacaa ctgaagcaac cagcrcagtca 2460 
ggaccaggcg atgcaggaag acagagcgcc 2520 
cagtgcttgt gtggatgaca cactgggaca 2580 
agtggacagt gatgaagatg ctcagatcca 2640 
tatgcaacag cctttcctgg aacccacgca 2700 
gctggctgcg gttggcatgg atggattaga 2760 
ttcccctgct gcctctgttt tacctcaccc 2820 
tgcaactgga attgcctatg accccttgat 2880 
caccacccac cctgagcatg ctggacgaat 2940 
tgggctgcta aataaatgtg agcgaattca 3000 
gcttgttcat tctgaacatc actcactgtt 3060 
gctggacccc aggatactcc taggtgatga 3120 
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ctctcaaaag tttttttcct cattaccttg 
ttggaatgag ctacactcgt ccggtgctgc 
ggcttccaaa gtggcctcag gagagctgaa 
ccatcacgct gaagaatcca cagccatggg 
cgccaaatac ttgagagacc aactaaatat 
tcaccatgga aacggtaccc agcaggcctt 
actccatcgc tatgatgaag ggaacttttt 
aacaggcctt ggagaagggt acaatataaa 
catgggagat gttgagtacc ttgaagcatt 
gtttgatcca gacatggtct tagtatctgc 
tcctctagga gggtacaaag tgacggcaaa 
gacattggct gatggacgtg tggtgttggc 
ctgtgatgca tcagaagcct gtgtaaatgc 
agaagatatt ctccaccaaa gcccgaatat 
tgaaattcaa agtatgtctt taaagttctc 
atgacaagta gatcccgggt ggcatccctg 
ggaagttgcc actccagtgc ctfaccagcct 
gtctgactag gtgtcctcta taatattatg 
cccaagttgg gaagacaacc tgtagggcct 
gcagtggcac aatcttggct cactgcaatc 
cctcagcctc ccgagttgtt gggattccag 
tttttttggt agagacgggg tttcaccata 
caggtgatct acccaccttg gcctcccaaa 
cccttccctg tccttctgat tttaaaataa 
cataggctac ctgccatggc ccaaccggtg 
ctctcatgcg ttgggtccac tcagtagatg 
gtggaatgtg tgtcagttag ggtgtggaaa 
gcaaagcatg catctcaatt agtcagcaac 
caggcagaag tatgcaaagc atgcatctca 
ctccgcccat cccgccccta actccgccca 
taattttttt tatttatgca gaggccgagg 
agtgaggagg cttttttgga ggcctaggct 
accagaaagt taattcccta tagtgagtcg 
ttcctgtgtg aaattgttat ccgctcacaa 
agtgtaaagc ctggggtgcc taatgagtga 
tgcccgcttt ccagtcggga aacctgtcgt 
cggggagagg cggtttgcgt attgggcgct 
gctcggtcgt tcggctgcgg cgagcggtat 
ccacagaatc aggggataac gcaggaaaga 
ggaaccgtaa aaaggccgcg ttgctggcgt 
atcacaaaaa tcgacgctca agtcagaggt 
aggcgtttcc ccctggaagc tccctcgtgc 
gatacctgtc cgcctttctc ccttcgggaa 
ggtatctcag ttcggtgtag gtcgttcgct 
ttcagcccga ccgctgcgcc ttatccggta 
acgacttatc gccactggca gcagccactg 
gcggtgctac agagttcttg aagtggtggc 
ttggtatctg cgctctgctg aagccagtta 
ccggcaaaca aaccaccgct ggtagcggtg 
gcagaaaaaa aggatctcaa gaagatcctt 
ggaacgaaaa ctcacgttaa gggattttgg 
agatcctttt aaattaaaaa tgaagtttta 
ggtctgacag ttaccaatgc ttaatcagtg 
gttcatccat agttgcctga ctccccgtcg 
catctggccc cagtgctgca atgataccgc 
cagcaataaa ccagccagcc ggaagggccg 
cctccatcca gtctattaat tgttgccggg 
gtttgcgcaa cgttgttgcc attgctacag 
tggcttcatt cagctccggt tcccaacgat 
gcaaaaaagc ggttagctcc ttcggtcctc 
tgttatcact catggttatg gcagcactgc 
gatgcttttc tgtgactggt gagtactcaa 



tggtggactt ggggtggaca gtgacaccat 3180 
acgcatggct gttggctgtg tcatcgagct 3240 
gaatgggttt gctgttgtga ggccccctgg 3300 
gttctgcttt tttaattcag ttgcaattac 3360 
aagcaagata ttgattgtag atctggatgt 3420 
ttatgctgac cccagcatcc tgtacatttc 3480 
ccctggcagt ggagccccaa atgaggttgg 3540 
tattgcctgg acaggtggcc ttgatcctcc 3600 
caggaccatc gtgaagcctg tggccaaaga 3660 
tggatttgat gcattggaag gccacacccc 3720 
atgttttggt catttgacga agcaattgat 3780 
tctagaagga ggacatgatc tcacagccat 3840 
ccttctagga aatgagctgg agccacttgc 3900 
gaatgctgtt atttctttac agaagatcat 3960 
tggatccggt accagattac aaggacgacg 4020 
tgacccctcc ccagtgcctc tcctggcctt 4080 
tgtcctaata aaattaagtt gcatcatttt 4140 
gggtggaggg gggtggtatg gagcaagggg 4200 
gcggggtcta ttcgggaacc aagctggagt 4260 
tccgcctcct gggttcaagc gattctcctg 4320 
gcatgcatga ccaggctcag ctaatttttg 4380 
ttggccaggc tggtctccaa ctcctaatct 4440 
ttgctgggat tacaggcgtg aaccactgct 4500 
ctataccagc aggaggacgt ccagacacag 4560 
ggacatttga gttgcttgct tggcactgtc 4620 
cctgttgaat tgggtacgcg gccagcttct 4680 
gtccccaggc tccccagcag gcagaagtat 4740 
caggtgtgga aaagtcccca ggctccccag 4800 
attagtcagc aaccatagtc ccgcccctaa 4860 
gttccgccca ttctccgccc catggctgac 4920 
ccgcctcggc ctctgagcta ttccagaagt 4980 
tttgcaaaaa gctcctcgag gaactgaaaa 5040 
tattaaattc gtaatcatgg tcatagctgt 5100 
ttccacacaa catacgagcc ggaagcataa 5160 
gctaactcac attaattgcg ttgcgctcac 5220 
gccagctgca ttaatgaatc ggccaacgcg 5280 
cttccgcttc ctcgctcact gactcgctgc 5340 
cagctcactc aaaggcggta atacggttat 5400 
acatgtgagc aaaaggccag caaaaggcca 5460 
ttttccatag gctccgcccc cctgacgagc 5520 
ggcgaaaccc gacaggacta taaagatacc 5580 
gctctcctgt tccgaccctg ccgcttaccg 5640 
gcgtggcgct ttctcaatgc tcacgctgta 5700 
ccaagctggg ctgtgtgcac gaaccccccg 5760 
actatcgtct tgagtccaac ccggtaagac 5820 
gtaacaggat tagcagagcg aggtatgtag 5880 
ctaactacgg ctacactaga agaacagtat 5940 
ccttcggaaa aagagttggt agctcttgat 6000 
gtttttttgt ttgcaagcag cagattacgc 6060 
tgatcttttc tacggggtct gacgctcagt 6120 
tcatgagatt atcaaaaagg atcttcacct 6180 
aatcaatcta aagtatatat gagtaaactt 6240 
aggcacctat ctcagcgatc tgtctatttc 6300 
tgtagataac tacgatacgg gagggcttac 6360 
gagacccacg ctcaccggct ccagatttat 6420 
agcgcagaag tggtcctgca actttatccg 6480 
aagctagagt aagtagttcg ccagttaata 6540 
gcatcgtggt gtcacgctcg tcgtttggta 6600 
caaggcgagt tacatgatcc cccatgttgt 6660 
cgatcgttgt cagaagtaag ttggccgcag 6720 
ataattctct tactgtcatg ccatccgtaa 6780 
ccaagtcatt ctgagaatag tgtatgcggc 6840 
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gaccgagttg ctcttgcccg gcgtcaatac 
taaaagtgct catcattgga aaacgttctt 
tgttgagatc cagttcgatg taacccactc 
ctttcaccag cgtttctggg tgagcaaaaa 
taagggcgac acggaaatgt tgaatactca 
tttatcaggg ttattgtctc atgagcggat 
aaataggggt tccgcgcaca tttccccgaa 
cattaagcgc ggcgggtgtg gtggttacgc 
tagcgcccgc tcctttcgct ttcttccctt 
gtcaagctct aaatcggggc atccctttag 
accccaaaaa acttgattag ggtgatggtt 
tttttcgccc tttgacgttg gagtccacgt 
gaacaacact caaccctatc tcggtctatt 
cggcctattg gttaaaaaat gagctgattt 
tattaaacgt ttacaattt 



gggataatac cgcgccacat agcagaactt 6900 
cggggcgaaa actctcaagg atcttaccgc 6960 
gtgcacccaa ctgatcttca gcatctttta 7020 
caggaaggca aaatgccgca aaaaagggaa 7080 
tactcttcct ttttcaatat tattgaagca 7140 
acatatttga atgtatttag aaaaataaac 7200 
aagtgccacc tgacgcgccc tgtagcggcg 7260 
gcagcgtgac cgctacactt gccagcgccc 7320 
cctttctcgc cacgttcgcc ggctttcccc 7380 
ggttccgatt tagtgcttta cggcacctcg 7440 
cacgtagtgg gccatcgccc tgatagacgg 7500 
tctttaatag tggactcttg ttccaaactg 7560 
cttttgattt ataagggatt ttgccgattt 7620 
aacaaaaatt taacgcgaat tttaacaaaa 7680 

7699 



<210> 15 
<211> 7303 
<212> DNA 

<213> Homo sapiens 
<400> 15 

cccattcgcc attcaggctg cgcaactgtt 
tattacgcca gctggcgaaa gggggatgtg 
gggttttccc agtcacgacg ttgtaaaacg 
ttggccatta gccatattat tcattggtta 
attgcatacg ttgtatccat atcataatat 
accgccatgt tgacattgat tattgactag 
agttcatagc ccatatatgg agttccgcgt 
cgaccgccca gcgacccccg cccgttgacg 
ccaataggga ctttccattg acgtcaatgg 
gcagtacatc aagtgtatca tatgccaagt 
tggcccgcct agcattatgc ccagtacatg 
atctacgtat tagtcatcgc tattaccatg 
cgtggatagc ggtttgactc acggggattt 
agtttgtttt ggcaccaaaa tcaacgggac 
ttgacgcaaa tgggcggtag gcgtgtacgg 
gtgaaccgtc agaattcaag cttgcggccg 
gcacagtatg atcagctcag tggatgtgaa 
ctcaccttta gacctaagga cagacctcag 
ccgtgagaag caattgcagc aggaattact 
gcagcttctg atagcagagt ttcagaaaca 
tcagcttcag gagcatatca aggaacttct 
aaaggagcag aaactggagc agcagaggca 
acagcagctt cctcctctca gaggcaaaga 
agaagtaaag cagaagcttc aagagttcct 
aactaatgga aaaaatcatt ccgtgagccg 
ccacacatca ttggatcaaa gctctccacc 
cacattacca ggagcacaag atgcaaagga 
gcccaacttg aaggtgcggt ccaggttaaa 
cttactcagg cggaaggatg gaaatgttgt 
gacagaatcc tcagtcagta gcagttctcc 
gccaactgga agtgttactg aaaatgagac 
gcaaatggtt tcacagcaac gcattctaat 
ttatacctct ccttctttgc ccaacattac 
caatgcttcg aattcactca aagaaaagca 
tgttcctctg cctgggcagt atggaggcag 
tactttagag ggaaagccac ccaacagcag 
gaaagaacaa atgcgacagc aaaagcttct 
gtctcccttg gcaacaaaag agagaatttc 



gggaagggcg atcggtgcgg gcctcttcgc 60 
ctgcaaggcg attaagttgg gtaacgccca 120 
acggccagtg ccaagctgat ctaatcaata 180 
tatagcataa atcaatattg gctattggcc 240 
gtacatttat attggctcat gtccaacatt 300 
ttattaatag taatcaatta cggggtcatt 3 60 
tacataactt acggtaaatg gcccgcctgg 420 
tcaatagtga cgtatgttcc catagtaacg 480 
gtggagtatt tacggtaaac tgcccacttg 540 
ccgcccccta ttgacgtcaa tgacggtaaa 600 
accttacggg agtttcctac ttggcagtac 660 
gtgatgcggt tttggcagta caccaatggg 720 
ccaagtctcc accccattga cgtcaatggg 780 
tttccaaaat gtcgtaataa ccccgccccg 840 
tgggaggtct atataagcag agctcgttta 900 
cagatctatc gatctgcagg atatcaccat 9 60 
gtcagaagtt cctgtgggcc tggagcccat 1020 
gatgatgatg cccgtggtgg accctgttgt 1080 
tcttatccag cagcagcaac aaatccagaa 1140 
gcatgagaac ttgacacggc agcaccaggc 1200 
agccataaaa cagcaacaag aactcctaga 1260 
agaacaggaa gtagagaggc atcgcagaga 1320 
tagaggacga gaaagggcag tggcaagtac 1380 
actgagtaaa tcagcaacga aagacactcc 1440 
ccatcccaag ctctggtaca cggctgccca 1500 
ccttagtgga acatctccat cctacaagta 1560 
tgatttcccc cttcgaaaaa ctgcctctga 1620 
acagaaagtg gcagagagga gaagcagccc 1680 
cacttcattc aagaagcgaa tgtttgaggt 1740 
aggctctggt cccagttcac caaacaatgg 1800 
ttcggttttg ccccctaccc ctcatgccga 1860 
tcatgaagat tccatgaacc tgctaagtct 1920 
cttggggctt cccgcagtgc catcccagct 1980 
gaagtgtgag acgcagacgc ttaggcaagg 2040 
catcccggca tcttccagcc accctcatgt 2100 
ccaccaggct ctcctgcagc atttattatt 2160 
tgtagctggt ggagttccct tacatcctca 2220 
acctggcatt agaggtaccc acaaattgcc 2280 
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ccgtcacaga cccctgaacc gaacccagtc 
gctggtcatt caacagcaac accagcaatt 
gatccacatg aacaaactgc tttcgaaatc 
ccttgaggaa gcagaggaag agcttcaggg 
ctctagtggc aacagcacta ggagcgacag 
agttggggct gtgaaggtca aggaggaacc 
ggaaatggaa tctggggagc aggctgcttt 
cacacgtgcg ctctctgtgc gccaagctcc 
gaaacaccgt ctcgtctcca ggactcactc 
agcaatggac cgccccctcc agcctggctc 
gctgaaacac cagtgcgttt gtggcaattc 
acagagtatc tggtcacgac tgcaagaaac 
aggtcgaaaa gccagcctgg aggaaataca 
gtatggcacc aaccccctgg acggacagaa 
ctctcaaaag tttttttcct cattaccttg 
ttggaatgag ctacactcgt ccggtgctgc 
ggcttccaaa gtggcctcag gagagctgaa 
ccatcacgct gaagaatcca cagccatggg 
cgccaaatac ttgagagacc aactaaatat 
tcaccatgga aacggtaccc agcaggcctt 
actccatcgc tatgatgaag ggaacttttt 
gtttatttct ttagagcccc acttttattt 
cggtaccaga ttacaaggac gacgatgaca 
ctccccagtg cctctcctgg ccttggaagt 
aataaaatta agttgcatca ttttgtctga 
aggggggtgg tatggagcaa ggggcccaag 
tctattcggg aaccaagctg gagtgcagtg 
tcctgggttc aagcgattct cctgcctcag 
atgaccaggc tcagctaatt tttgtttttt 
aggctggtct ccaactccta atctcaggtg 
ggattacagg cgtgaaccac tgctcccttc 
cagcaggagg acgtccagac acagcatagg 
ttgagttgct tgcttggcac tgtcctctca 
gaattgggta cgcggccagc ttctgtggaa 
aggctcccca gcaggcagaa gtatgcaaag 
tggaaaagtc cccaggctcc ccagcaggca 
cagcaaccat agtcccgccc ctaactccgc 
cccattctcc gccccatggc tgactaattt 
cggcctctga gctattccag aagtagtgag 
aaaagctcct cgaggaactg aaaaaccaga 
attcgtaatc atggtcatag ctgtttcctg 
acaacatacg agccggaagc ataaagtgta 
tcacattaat tgcgttgcgc tcactgcccg 
tgcattaatg aatcggccaa cgcgcgggga 
cttcctcgct cactgactcg ctgcgctcgg 
actcaaaggc ggtaatacgg ttatccacag 
gagcaaaagg ccagcaaaag gccaggaacc 
ataggctccg cccccctgac gagcatcaca 
acccgacagg actataaaga taccaggcgt 
ctgttccgac cctgccgctt accggatacc 
cgctttctca atgctcacgc tgtaggtatc 
tgggctgtgt gcacgaaccc cccgttcagc 
gtcttgagtc caacccggta agacacgact 
ggattagcag agcgaggtat gtaggcggtg 
acggctacac tagaagaaca gtatttggta 
gaaaaagagt tggtagctct tgatccggca 
ttgtttgcaa gcagcagatt acgcgcagaa 
tttctacggg gtctgacgct cagtggaacg 
gattatcaaa aaggatcttc acctagatcc 
tctaaagtat atatgagtaa acttggtctg 
ctatctcagc gatctgtcta tttcgttcat 
taactacgat acgggagggc ttaccatctg 



tgcacctttg cctcagagca cgttggctca 2340 
cttggagaag cagaagcaat accagcagca 2400 
tattgaacaa ctgaagcaac caggcagtca 2460 
ggaccaggcg atgcaggaag acagagcgcc 2520 
cagtgcttgt gtggatgaca cactgggaca 2580 
agtggacagt gatgaagatg ctcagatcca 2640 
tatgcaacag cctttcctgg aacccacgca 2700 
gctggctgcg gttggcatgg atggattaga 2760 
ttcccctgct gcctctgttt tacctcaccc 2820 
tgcaactgga attgcctatg accccttgat 2880 
caccacccac cctgagcatg ctggacgaat 2940 
tgggctgcta aataaatgtg agcgaattca 3000 
gcttgttcat tctgaacatc actcactgtt 3060 
gctggacccc aggatactcc taggtgatga 3120 
tggtggactt ggggtggaca gtgacaccat 3180 
acgcatggct gttggctgtg tcatcgagct 3240 
gaatgggttt gctgttgtga ggccccctgg 3300 
gttctgcttt tttaattcag ttgcaattac 3360 
aagcaagata ttgattgtag atctggatgt 3420 
ttatgctgac cccagcatcc tgtacatttc 3480 
ccctggcagt ggagccccaa atgaggttcg 3540 
gtatctttca ggtaattgca ttgcaggatc 3600 
agtagatccc gggtggcatc cctgtgaccc 3660 
tgccactcca gtgcccacca gccttgtcct 3720 
ctaggtgtcc tctataatat tatggggtgg 3780 
ttgggaagac aacctgtagg gcctgcgggg 3840 
gcacaatctt ggctcactgc aatctccgcc 3900 
cctcccgagt tgttgggatt ccaggcatgc 3960 
tggtagagac ggggtttcac catattggcc 4020 
atctacccac cttggcctcc caaattgctg 4080 
cctgtccttc tgattttaaa ataactatac 4140 
ctacctgcca tggcccaacc ggtgggacat 4200 
tgcgttgggt ccactcagta gatgcctgtt 4260 
tgtgtgtcag ttagggtgtg gaaagtcccc 4320 
catgcatctc aattagtcag caaccaggtg 4380 
gaagtatgca aagcatgcat ctcaattagt 4440 
ccatcccgcc cctaactccg cccagttccg 4500 
tttttattta tgcagaggcc gaggccgcct 4560 
gaggcttttt tggaggccta ggcttttgca 4620 
aagttaattc cctatagtga gtcgtattaa 4680 
tgtgaaattg ttatccgctc acaattccac 4740 
aagcctgggg tgcctaatga gtgagctaac 4800 
ctttccagtc gggaaacctg tcgtgccagc 4860 
gaggcggttt gcgtattggg cgctcttccg 4920 
tcgttcggct gcggcgagcg gtatcagctc 4980 
aatcagggga taacgcagga aagaacatgt 5040 
gtaaaaaggc cgcgttgctg gcgtttttcc 5100 
aaaatcgacg ctcaagtcag aggtggcgaa 5160 
ttccccctgg aagctccctc gtgcgctctc 5220 
tgtccgcctt tctcccttcg ggaagcgtgg 5280 
tcagttcggt gtaggtcgtt cgctccaagc 5340 
ccgaccgctg cgccttatcc ggtaactatc 5400 
tatcgccact ggcagcagcc actggtaaca 5460 
ctacagagtt cttgaagtgg tggcctaact 5520 
tctgcgctct gctgaagcca gttaccttcg 5580 
aacaaaccac cgctggtagc ggtggttttt 5640 
aaaaaggatc tcaagaagat cctttgatct 5700 
aaaactcacg ttaagggatt ttggtcatga 5760 
ttttaaatta aaaatgaagt tttaaatcaa 5820 
acagttacca atgcttaatc agtgaggcac 5880 
ccatagttgc ctgactcccc gtcgtgtaga 5940 
gccccagtgc tgcaatgata ccgcgagacc 6000 
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cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca 6060 
gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta 6120 
gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg 6180 
tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc 6240 
gagttacatg atcccccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg 6300 
ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt 6360 
ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt 6420 
cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata 6480 
ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc 6540 
gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac 6600 
ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa 6660 
ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct 6720 
tcctttttca atattattga agcatttatc agggttattg tctcatgagc ggatacatat 6780 
ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc 6840 
cacctgacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt acgcgcagcg 6900 
tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc ccttcctttc 6960 
tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg gggcatccct ttagggttcc 7020 
gatttagtgc tttacggcac ctcgacccca aaaaacttga ttagggtgat ggttcacgta 7080 
gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc acgttcttta 7140 
atagtggact cttgttccaa actggaacaa cactcaaccc tatctcggtc tattcttttg 7200 
atttataagg gattttgccg atttcggcct attggttaaa aaatgagctg atttaacaaa 7260 
aatttaacgc gaattttaac aaaatattaa acgtttacaa ttt 7303 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 16 

ccatggaaac ggtacccagc aggc 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 17 

cactccatcg ctatgatgaa ggg 

<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 18 

agttcccttc atcatagcga tgg 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer used to amplify human DNA 
<400> 19 

aatgtacagg atgctggggt 

<210> 20 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 20 

cccttgtagc tggtggagtt ccctt 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 21 

tgtgtcatcg agctggcttc 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 22 

atcttctgca agtggctcca 
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Thr 


His 


Ser 


Ser Pro Ala Ala Ser 


Val Leu Pro His Pro Ala 


Met Asp 




610 




615 


c n r\ 




Arg 


Pro 


Leu 


Gin Pro Gly Ser Ala 


Thr Gly He Ala Tyr Asp 


Pro Leu 


625 






630 


635 


o4U 


Met 


Leu 


Lys 


His Gin Cys Val Cys 


Gly Asn Ser Thr Thr His 


Pro Glu 








645 


650 


c c tr 

655 


His 


Ala 


Gly 


Arg lie Gin Ser lie 


Trp Ser Arg Leu Gin Glu 


Thr Gly 








660 


665 670 




Leu 


Leu 


Asn 


Lys Cys Glu Arg lie 


Gin Gly Arg Lys Ala Ser 


Leu Glu 






675 


680 


685 




Glu 


lie 


Gin 


Leu Val His Ser Glu 


His His Ser Leu Leu Tyr 


Gly Thr 




690 




695 


700 




Asn 


Pro 


Leu 


Asp Gly Gin Lys Leu 


Asp Pro Arg lie Leu Leu 


Gly Asp 


705 






710 


715 


720 


Asp 


Ser 


Gin 


Lys Phe Phe Ser Ser 


Leu Pro Cys Gly Gly Leu 


Gly Val 








725 


730 


735 


Asp 


Ser 


Asp 


Thr lie Trp Asn Glu 


Leu His Ser Ser Gly Ala 


Ala Arg 








740 


745 750 




Met 


Ala 


Val 


Gly Cys Val He Glu 


Leu Ala Ser Lys Val Ala 


Ser Gly 






755 


760 


765 




Glu 


Leu 


Lys 


Asn Gly Phe Ala Val 


Val Arg Pro Pro Gly His 


His Ala 




770 




775 


780 




Glu 


Glu 


Ser 


Thr Ala Met Gly Phe 


Cys Phe Phe Asn Ser Val 


Ala He 


785 






790 


795 


800 


Thr 


Ala 


Lys 


Tyr Leu Arg Asp Gin 


Leu Asn He Ser Lys He 


Leu He 








805 


810 


815 


Val 


Asp 


Leu 


Asp Val His His Gly 


Asn Gly Thr Gin Gin Ala 


Phe Tyr 








820 


825 830 




Ala 


Asp 


Pro 


Ser He Leu Tyr He 


Ser Leu His Arg Tyr Asp 


Glu Gly 






835 


840 


845 




Asn 


Phe 


Phe 


Pro Gly Ser Gly Ala 


Pro Asn Glu Val Arg Phe 


He Ser 




850 




855 


860 




Leu 


Glu 


Pro 


His Phe Tyr Leu Tyr 


Leu Ser Gly Asn Cys He 


Ala 


865 






870 


875 





<210> 5 

<211> 3054 

<212> DNA 

<213> Homo sapiens 

<400> 5 

ggggaagaga ggcacagaca 
tgagggtttt tgcaacaaaa 
ggacgagagc agctcttggc 
aagtcagaag ttcctgtggg 
aggatgatga tgcccgtggt 
cttcttatcc agcagcagca 
cagcatgaga acttgacacg 
ctagccataa aacagcaaca 
caagaacagg aagtagagag 
gatagaggac gagaaagggc 
ctactgagta aatcagcaac 
cgccatccca agctctggta 
ccccttagtg gaacatctcc 
gatgatttcc cccttcgaaa 
cccagttcac caaacaatgg 
ccccctaccc ctcatgccga 
tccatgaacc tgctaagtct 
cccgcagtgc catcccagct 
acgcagacgc ttaggcaagg 
tcttccagcc accctcatgt 



cagataggag aagggcaccg 
ccctagcagc ctgaagaact 
tcagcaaaga atgcacagta 
cctggagccc atctcacctt 
ggaccctgtt gtccgtgaga 
acaaatccag aagcagcttc 
gcagcaccag gctcagcttc 
agaactccta gaaaaggagc 
gcatcgcaga gaacagcagc 
agtggcaagt acagaagtaa 
gaaagacact ccaactaatg 
cacggctgcc caccacacat 
atcctacaag tacacattac 
aactgaatcc tcagtcagta 
gccaactgga agtgttactg 
gcaaatggtt tcacagcaac 
ttatacctct ccttctttgc 
caatgcttcg aattcactca 
tgttcctctg cctgggcagt 
tactttagag ggaaagccac 



gctggagcca cttgcaggac 60 
ctaagccaga tggggtggct 120 
tgatcagctc agtggatgtg 180 
tagacctaag gacagacctc 240 
agcaattgca gcaggaatta 300 
tgatagcaga gtttcagaaa 360 
aggagcatat caaggaactt 420 
agaaactgga gcagcagagg 480 
ttcctcctct cagaggcaaa 540 
agcagaagct tcaagagttc 600 
gaaaaaatca ttccgtgagc 660 
cattggatca aagctctcca 720 
caggagcaca agatgcaaag 780 
gcagttctcc aggctctggt 840 
aaaatgagac ttcggttttg 900 
gcattctaat tcatgaagat 960 
ccaacattac cttggggctt 1020 
aagaaaagca gaagtgtgag 1080 
atggaggcag catcccggca 1140 
ccaacagcag ccaccaggct 1200 
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ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cctgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
gttggctgtg tcatcgagct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
gctgttgtga ggccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttgg aacaggcctt ggagaagggt acaatataaa tattgcctgg 2640 
acaggtggcc ttgatcctcc catgggagat gttgagtacc ttgaagcatt caggaccatc 2700 
gtgaagcctg tggccaaaga gtttgatcca gacatggtct tagtatctgc tggatttgat 2760 
gcattggaag gccacacccc tcctctagga gggtacaaag tgacggcaaa atgttttggt 2820 
catttgacga agcaattgat gacattggct gatggacgtg tggtgttggc tctagaagga 2880 
ggacatgatc tcacagccat ctgtgatgca tcagaagcct gtgtaaatgc ccttctagga 2940 
aatgagctgg agccacttgc agaagatatt ctccaccaaa gcccgaatat gaatgctgtt 3000 
atttctttac agaagatcat tgaaattcaa agtatgtctt taaagttctc ttaa 3054 

<210> 6 

<211> 967 

<212> PRT 

<213> Homo sapiens 

<400> 6 



Met 


His 


Ser 


Met 


lie 


Ser 


Ser 


Val 


Asp 


Val 


Lys 


Ser 


Glu 


Val 


Pro 


Val 


1 








5 










10 










15 




Gly 


Leu 


Glu 


Pro 
20 


He 


Ser 


Pro 


Leu 


Asp 
25 


Leu 


Arg 


Thr 


Asp 


Leu 
30 


Arg 


Met 


Met 


Met 


Pro 
35 


Val 


Val 


Asp 


Pro 


Val 
40 


Val 


Arg 


Glu 


Lys 


Gin 
45 


Leu 


Gin 


Gin 


Glu 


Leu 
50 


Leu 


Leu 


He 


Gin 


Gin 
55 


Gin 


Gin 


Gin 


He 


Gin 
60 


Lys 


Gin 


Leu 


Leu 


lie 


Ala 


Glu 


Phe 


Gin 


Lys 


Gin 


His 


Glu 


Asn 


Leu 


Thr 


Arg 


Gin 


His 


Gin 


65 










70 










75 










80 


Ala 


Gin 


Leu 


Gin 


Glu 
85 


His 


lie 


Lys 


Glu 


Leu 
90 


Leu 


Ala 


He 


Lys 


Gin 
95 


Gin 


Gin 


Glu 


Leu 


Leu 
100 


Glu 


Lys 


Glu 


Gin 


Lys 
105 


Leu 


Glu 


Gin 


Gin 


Arg 
110 


Gin 


Glu 


Gin 


Glu 


Val 
115 


Glu 


Arg 


His 


Arg 


Arg 
120 


Glu 


Gin 


Gin 


Leu 


Pro 
125 


Pro 


Leu 


Arg 


Gly 


Lys 
130 


Asp 


Arg 


Gly 


Arg 


Glu 
135 


Arg 


Ala 


Val 


Ala 


Ser 
140 


Thr 


Glu 


Val 


Lys 


Gin 


Lys 


Leu 


Gin 


Glu 


Phe 


Leu 


Leu 


Ser 


Lys 


Ser 


Ala 


Thr 


Lys 


Asp 


Thr 


145 
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155 
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Pro 


Thr 


Asn 


Gly 


Lys 


Asn 
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Ser 


Val 


Ser 


Arg 


His 


Pro 


Lys 


Leu 


Trp 
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Tyr 


Thr 


Ala 


Ala 
180 


His 


His 


Thr 


Ser 


Leu 
185 


Asp 


Gin 


Ser 


Ser 


Pro 
190 


Pro 


Leu 
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Ser 


Gly 


Thr 


Ser Pro Ser Tyr 


Lys Tyr Thr 


Leu 


Pro 


Gly 


Ala 


Gin 


Asp 
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Lys 
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Asp Phe Pro Leu 


Arg Lys 


Thr 
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Ser 


Ser 


Val 


Ser 


Ser 
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Ser 


Ser 


Pro 


Gly Ser Gly Pro 


Ser Ser 


Pro 


Asn 


Asn 


Gly 


Pro 


Thr 


Gly 


225 






230 
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Val 


Thr 


Glu Asn Glu Thr 


Ser Val 


Leu 


Pro 


Pro 


Thr 


Pro 


His 


Ala 
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255 




Glu 


Gin 


Met 


Val Ser Gin Gin 


Arg He 


Leu 


He 


His 


Glu 


Asp 


Ser 


Met 
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265 
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Leu 


Leu 


Ser Leu Tyr Thr 


Ser Pro 


Ser 


Leu 


Pro 


Asn 


He 


Thr 


Leu 






275 




280 








285 
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Leu 


Pro 


Ala Val Pro Ser 


Gin Leu 


Asn 


Ala 


Ser 


Asn 


Ser 


Leu 


Lys 




290 




295 
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Glu 


Lys 


Gin 


Lys Cys Glu Thr 


Gin Thr 


Leu 


Arg 


Gin 


Gly 


Val 


Pro 


Leu 


305 






310 






315 










320 


Pro 


Gly 


Gin 


Tyr Gly Gly Ser 


He Pro 


Ala 


Ser 


Ser 


Ser 


His 


Pro 


His 
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330 










335 




Val 


Thr 


Leu 


Glu Gly Lys Pro 


Pro Asn 


Ser 


Ser 


His 


Gin 


Ala 


Leu 


Leu 








340 


345 










350 






Gin 


His 


Leu 


Leu Leu Lys Glu 


Gin Met Arg 


Gin 


Gin 


Lys 


Leu 


Leu 


Val 
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360 








365 








Ala 


Gly 


Gly 


Val Pro Leu His 


Pro Gin 


Ser 


Pro 


Leu 


Ala 


Thr 


Lys 


Glu 




370 




375 








380 










Arg 
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Ser 


Pro Gly lie Arg 


Gly Thr His 


Lys 


Leu 


Pro 


Arg 


His 


Arg 


385 






390 






395 










400 


Pro 


Leu 


Asn 


Arg Thr Gin Ser 


Ala Pro 


Leu 


Pro 


Gin 


Ser 


Thr 


Leu 


Ala 








405 




410 










415 




Gin 


Leu 


Val 


lie Gin Gin Gin 


His Gin 


Gin 


Phe 


Leu 


Glu 


Lys 


Gin 


Lys 








420 


425 










430 






Gin 


Tyr 


Gin 


Gin Gin He His 


Met Asn Lys 


Leu 


Leu 


Ser 


Lys 


Ser 


He 






435 




440 








445 








Glu 


Gin 


Leu 


Lys Gin Pro Gly 


Ser His 


Leu 


Glu 


Glu 


Ala 


Glu 


Glu 


Glu 




450 




455 








460 










Leu 


Gin 


Gly 


Asp Gin Ala Met 


Gin Glu 


Asp 


Arg 


Ala 


Pro 


Ser 


Ser 


Gly 


465 






470 






475 










480 
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Ser 


Thr 


Arg Ser Asp Ser 


Ser Ala 


Cys 


Val 


Asp 


Asp 


Thr 


Leu 


Gly 








485 




490 










495 




Gin 


Val 


Gly 


Ala Val Lys Val 


Lys Glu 


Glu 


Pro 


Val 


Asp 


Ser 


Asp 


Glu 








500 


505 










510 






Asp 


Ala 


Gin 


He Gin Glu Met 


Glu Ser Gly 


Glu 


Gin 


Ala 


Ala 


Phe 


Met 






515 




520 








525 








Gin 


Gin 


Pro 


Phe Leu Glu Pro 


Thr His 


Thr 


Arg 


Ala 


Leu 


Ser 


Val 


Arg 




530 




535 








540 










Gin 


Ala 


Pro 


Leu Ala Ala Val 


Gly Met Asp 


Gly 


Leu 


Glu 


Lys 


His 


Arg 


545 






550 






555 










560 


Leu 


Val 


Ser 


Arg Thr His Ser 


Ser Pro Ala 


Ala 


Ser 


Val 


Leu 


Pro 


His 








565 




570 










575 




Pro 


Ala 


Met 


Asp Arg Pro Leu 


Gin Pro Gly 


Ser 


Ala 


Thr 


Gly 


He 


Ala 








580 


585 










590 






Tyr 


Asp 


Pro 


Leu Met Leu Lys 


His Gin Cys 


Val 


Cys 


Gly 


Asn 


Ser 


Thr 






595 




600 








605 








Thr 


His 


Pro 


Glu His Ala Gly 


Arg He 


Gin 


Ser 


He 


Trp 


Ser 


Arg 


Leu 




610 




615 








620 










Gin 


Glu 


Thr 


Gly Leu Leu Asn 


Lys Cys Glu 


Arg 


He 


Gin 


Gly 


Arg 


Lys 


625 






630 






635 










640 


Ala 


Ser 


Leu 


Glu Glu He Gin 


Leu Val 


His 


Ser 


Glu 


His 


His 


Ser 


Leu 








645 




650 










655 




Leu 


Tyr 


Gly 


Thr Asn Pro Leu 


Asp Gly Gin 


Lys 


Leu 


Asp 


Pro 


Arg 


He 








660 


665 










670 






Leu 


Leu 


Gly 


Asp Asp Ser Gin 


Lys Phe 


Phe 


Ser 


Ser 


Leu 


Pro 


Cys 


Gly 






675 




680 








685 
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Gly 


Leu 


Gly 


Val 


Asp 


Ser Asp 


Thr 




690 








695 




Gly 


Ala 


Ala 


Arg Met 


Ala Val 


Gly 


705 










710 




Val 


Ala 


Ser 


Gly Glu 


Leu Lys 


Asn 










725 






Gly 


His 


His 


Ala 


Glu 


Glu Ser 


Thr 








740 








Ser 


Val 


Ala 


He 


Thr 


Ala Lys 


Tyr 






755 








760 


Lys 


He 


Leu 


He Val 


Asp Leu 


Asp 




770 








775 




Gin 


Ala 


Phe 


Tyr 


Ala 


Asp Pro 


Ser 


785 










790 




Tyr 


Asp 


Glu 


Gly Asn 


Phe Phe 


Pro 










805 






Gly 


Thr 


Gly 


Leu Gly 


Glu Gly Tyr 








820 








Gly 


Leu 


Asp 


Pro 


Pro 


Met Gly Asp 






835 








840 


Thr 


He 


Val 


Lys 


Pro 


Val Ala 


Lys 




850 








855 




Val 


Ser 


Ala 


Gly 


Phe 


Asp Ala 


Leu 


865 










870 




Gly 


Tyr 


Lys 


Val 


Thr 


Ala Lys 


Cys 










885 






Met 


Thr 


Leu 


Ala 


Asp 


Gly Arg Val 








900 








Asp 


Leu 


Thr 


Ala 


He 


Cys Asp Ala 






915 








920 


Leu 


Gly 


Asn 


Glu 


Leu 


Glu Pro 


Leu 




930 








935 




Pro 


Asn 


Met 


Asn 


Ala 


Val He 


Ser 


945 










950 




Ser 


Met 


Ser 


Leu Lys 


Phe Ser 












965 







He Trp 


Asn 


Glu Leu 


His 


Ser 


Ser 






700 








Cys Val 


He 


Glu Leu 


Ala 


Ser Lys 




715 








720 


Gly Phe 


Ala Val Val 


Arg 


Pro 


Pro 


730 








735 




Ala Met 


Gly Phe Cys 


Phe 


Phe 


Asn 


745 






750 






Leu Arg 


Asp Gin Leu 


Asn 


He 


Ser 






765 








Val His 


His 


Gly Asn 


Gly Thr Gin 






780 








lie Leu 


Tyr 


He Ser 


Leu 


His 


Arg 




795 








800 


Gly Ser 


Gly Ala Pro 


Asn 


Glu 


Val 


810 








815 




Asn He 


Asn 


He Ala 


Trp Thr Gly 


825 






830 






Val Glu 


Tyr 


Leu Glu 


Ala 


Phe 


Arg 






845 








Glu Phe 


Asp Pro Asp 


Met 


Val 


Leu 






860 








Glu Gly 


His 


Thr Pro 


Pro 


Leu Gly 




875 








880 


Phe Gly 


His 


Leu Thr 


Lys 


Gin 


Leu 


890 








895 




Val Leu 


Ala 


Leu Glu 


Gly Gly His 


905 






910 






Ser Glu 


Ala 


Cys Val 


Asn 


Ala 


Leu 






925 








Ala Glu 


Asp 


He Leu 


His 


Gin 


Ser 






940 








Leu Gin 


Lys 


He He 


Glu 


He 


Gin 




955 








960 



<210> 7 

<211> 3367 

<212> DNA 

<213> Homo sapiens 

<400> 7 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 

tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 

ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 

aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 

aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 3 00 

cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 3 60 

cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 

ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 

caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 

gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 

ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 

cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 

ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 

gatgatttcc cccttcgaaa aactgaatcc tcagtcagta gcagttctcc aggctctggt 840 

cccagttcac caaacaatgg gccaactgga agtgttactg aaaatgagac ttcggttttg 900 

ccccctaccc ctcatgccga gcaaatggtt tcacagcaac gcattctaat tcatgaagat 960 

tccatgaacc tgctaagtct ttatacctct ccttctttgc ccaacattac cttggggctt 1020 

cccgcagtgc catcccagct caatgcttcg aattcactca aagaaaagca gaagtgtgag 1080 
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acgcagacgc ttaggcaagg tgttcctctg cctgggcagt atggaggcag catcccggca 1140 
tcttccagcc accctcatgt tactttagag ggaaagccac ccaacagcag ccaccaggct 1200 
ctcctgcagc atttattatt gaaagaacaa atgcgacagc aaaagcttct tgtagctggt 1260 
ggagttccct tacatcctca gtctcccttg gcaacaaaag agagaatttc acctggcatt 1320 
agaggtaccc acaaattgcc ccgtcacaga cccctgaacc gaacccagtc tgcacctttg 1380 
cctcagagca cgttggctca gctggtcatt caacagcaac accagcaatt cttggagaag 1440 
cagaagcaat accagcagca gatccacatg aacaaactgc tttcgaaatc tattgaacaa 1500 
ctgaagcaac caggcagtca ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg 1560 
atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
cctttcctgg aacccacgca cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg 1800 
gttggcatgg atggattaga gaaacaccgt ctcgtctcca ggactcactc ttcccctgct 1860 
gcctctgttt tacctcaccc agcaatggac cgccccctcc agcctggctc tgcaactgga 1920 
attgcctatg accccttgat gctgaaacac cagtgcgttt gtggcaattc caccacccac 1980 
cctgagcatg ctggacgaat acagagtatc tggtcacgac tgcaagaaac tgggctgcta 2040 
aataaatgtg agcgaattca aggtcgaaaa gccagcctgg aggaaataca gcttgttcat 2100 
tctgaacatc actcactgtt gtatggcacc aaccccctgg acggacagaa gctggacccc 2160 
aggatactcc taggtgatga ctctcaaaag tttttttcct cattaccttg tggtggactt 2220 
ggggtggaca gtgacaccat ttggaatgag ctacactcgt ccggtgctgc acgcatggct 2280 
gttggctgtg tcatcgagct ggcttccaaa gtggcctcag gagagctgaa gaatgggttt 2340 
gctgttgtga ggccccctgg ccatcacgct gaagaatcca cagccatggg gttctgcttt 2400 
tttaattcag ttgcaattac cgccaaatac ttgagagacc aactaaatat aagcaagata 2460 
ttgattgtag atctggatgt tcaccatgga aacggtaccc agcaggcctt ttatgctgac 2520 
cccagcatcc tgtacatttc actccatcgc tatgatgaag ggaacttttt ccctggcagt 2580 
ggagccccaa atgaggttcg gtttatttct ttagagcccc acttttattt gtatctttca 2640 
ggtaattgca ttgcatgatt acccctaatt ttcttgtcct ttgctggtgt tttaaattac 2700 
acgagattac tgaattgtcc catgggacca agaaccagtg cagaacaagt gcataaccca 2760 
gagcactgtt tgtcagggaa ggttgggctg atttgatgtg ttgtttgatg tttatttcaa 2820 
gagctcccat gtgcttgttt tcctctcttc ttgctttctt ccatttgctc tcttctctgc 2880 
ccaccgtggt gtgtctttct cttcccaggt tggaacaggc cttggagaag ggtacaatat 2940 
aaatattgcc tggacaggtg gccttgatcc tcccatggga gatgttgagt accttgaagc 3000 
attcaggacc atcgtgaagc ctgtggccaa agagtttgat ccagacatgg tcttagtatc 3 060 
tgctggattt gatgcattgg aaggccacac ccctcctcta ggagggtaca aagtgacggc 3120 
aaaatgtttt ggtcatttga cgaagcaatt gatgacattg gctgatggac gtgtggtgtt 3180 
ggctctagaa ggaggacatg atctcacagc catctgtgat gcatcagaag cctgtgtaaa 3240 
tgcccttcta ggaaatgagc tggagccact tgcagaagat attctccacc aaagcccgaa 3300 
tatgaatgct gttatttctt tacagaagat cattgaaatt caaagtatgt ctttaaagtt 3360 
ctcttaa 3367 

<210> 8 

<211> 835 

<212> PRT 

<213> Homo sapiens 

<400> 8 
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55 60 
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Ala 


Glu 
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His 
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70 75 
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Ala 
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Leu 


Gin 


Glu 


His He Lys Glu Leu Leu Ala 


He Lys 


Gin 


Gin 










85 


90 


95 




Gin 


Glu 


Leu 


Leu 


Glu 


Lys Glu Gin Lys Leu Glu Gin 


Gin Arg 


Gin 


Glu 








100 




105 


110 






Gin 


Glu 


Val 


Glu 


Arg 


His Arg Arg Glu Gin Gin Leu 


Pro Pro 


Leu 


Arg 






115 






120 


125 
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Ser 


tjiy 


Thr 


Ser Pro Ser Tyr Lys 






iyD 




200 


TV 1 a 
A -Lei 


Lys 


Asp 


Asp Phe Pro Leu 


Arg 




zlU 
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Ser 


Ser 


Pro 


Gly Ser Gly Pro 


Ser 
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230 




Ser 


Val 


Thr 


Glu Asn Glu Thr 


Ser 








245 




Glu 


Gin 


Met 


Val Ser Gin Gin Arg 








260 




Asn 


Leu 


Leu 


Ser Leu Tyr Thr 


Ser 






275 




280 


Gly 


Leu 


Pro 


Ala Val Pro Ser 


Gin 




290 




295 




Glu 


Lys 


Gin 


Lys Cys Glu Thr 


Gin 


305 






310 




Pro 


Gly 


Gin 


Tyr Gly Gly Ser 


He 








325 




Val 


Thr 


Leu 


Glu Gly Lys Pro 


Pro 








340 




Gin 


His 


Leu 


Leu Leu Lys Glu 


Gin 






355 




360 


Ala 


Gly 


Gly 


Val Pro Leu His 


Pro 




370 




375 




Arg 


lie 


Ser 


Pro Gly lie Arg Gly 


385 






390 




Pro 


Leu 


Asn 


Arg Thr Gin Ser 


Ala 








405 




Gin 


Leu 


Val 


lie Gin Gin Gin 


His 








420 




Gin 


Tyr 


Gin 


Gin Gin lie His 


Met 






435 




440 
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Gin 


Leu 


Lys Gin Pro Gly Ser 
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Gin 
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Asp Gin Ala Met Gin 
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Ser 
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bin 
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Pro 


Phe Leu Glu Pro 
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530 
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Ala 


Pro 
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Val 


Ser 


Arg Thr His Ser 
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Met 


Asp Arg Pro Leu 


Gin 
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Pro 


Leu Met Leu Lys 


His 
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Thr 


His 


Pro 


Glu His Ala Gly Arg 




610 
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Tyr 
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Pro Gly Ala Gin Asp 
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Tnr 
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Ser Ser Val 


Ser 


Ser 
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Ser 


Pro 
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Asn Gly Pro 


Thr 


Gly 
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Val 
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Pro Thr Pro 


His 
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He 
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Leu Glu Lys 
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Ala Pro Ser 
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Glu 
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Ser 


Gly 


Glu 


Gin Ala Ala 


Phe 


Met 
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Ala Leu Ser Val 


Arg 








540 






Met 


Asp 


Gly 


Leu Glu Lys 


His 


Arg 






555 
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Pro 


Ala 
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Ser Val Leu 


Pro 


His 




570 
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Pro 


Gly 


Ser 


Ala Thr Gly 


He 


Ala 
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Gin 


Cys 


Val 


Cys Gly Asn 


Ser 


Thr 
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He 


Gin 


Ser 


He Trp Ser 
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rile 


Ser 


oer jjeu Jrro uys v^xy 




o /o 


OOU 






QOJ 


Gly Leu 


ijiy 


Vai nop Ocl ASp 1 I1J- ixe 


Trp 


Asn 


Pill T.eni 14 n o Cor* Qor 
ulU JjcU nib gel Del 


by U 










/ uu 


Caiy Ala 


Ala 


A v*~r Ala T7s»1 Plir Pi.rf 

Arg Met Aia vai ijiy uys 


vai 


Tl <a 

ne 


Pi 11 T./>»i, Ala Oo"V" T.trri 

vjiu jjeu Aia. ber -Mys 


/UD 




7 1 n 

/ 1U 




7 1 R 




vai Ala 


Ser 


Gly Glu Leu Lys Asn Gly 


pne 


Aia 


vai vai Arg Fro rro 






/ZD 


Tin 




7^ ^ 


(jiy His 


His 


iii » pi,, pi n n QT . rnVi -v~ si ^ 

Ala Cjiu (jiu oer inr Aia 


jMeu 


pi, r 
tjriy 


irne cys rfie riis Asn 












/ JU 


Ser Val 


Ala 


lie inr Aia iiys ryr Leu 


Arg 


Asp 


Gin Leu Asn He Ser 




755 


760 






765 


Lys lie 


Leu 


He Val Asp Leu Asp Val 


His 


His 


Gly Asn Gly Thr Gin 


770 




775 






780 


Gin Ala 


Phe 


Tyr Ala Asp Pro Ser He 


Leu 


Tyr 


He Ser Leu His Arg 


785 




790 




795 


800 


Tyr Asp 


Glu 


Gly Asn Phe Phe Pro Gly 


Ser 


Gly 


Ala Pro Asn Glu Val 






805 


810 




815 


Arg Phe 


He 


Ser Leu Glu Pro His Phe 


Tyr 


Leu 


Tyr Leu Ser Gly Asn 






820 825 






830 


Cys He 


Ala 












835 











<210> 9 

<211> 1791 

<212> DNA 

<213> Homo sapiens 

<400> 9 

ggggaagaga ggcacagaca 
tgagggtttt tgcaacaaaa 
ggacgagagc agctcttggc 
aagtcagaag ttcctgtggg 
aggatgatga tgcccgtggt 
cttcttatcc agcagcagca 
cagcatgaga acttgacacg 
etagecataa aacagcaaca 
caagaacagg aagtagagag 
gatagaggac gagaaagggc 
ctactgagta aatcagcaac 
cgccatccca agctctggta 
ccccttagtg gaacatctcc 
gatgatttcc cccttcgaaa 
cccagttcac caaacaatgg 
ccccctaccc ctcatgccga 
tccatgaacc tgetaagtet 
cccgcagtgc catcccagct 
acgcagacgc ttaggcaagg 
tcttccagcc accctcatgt 
ctcctgcagc atttattatt 
ggagttccct tacatcctca 
agaggtaccc acaaattgee 
cctcagagca cgttggctca 
cagaagcaat accagcagca 
ctgaagcaac caggcagtca 



cagataggag aagggcaccg 
ccctagcagc ctgaagaact 
tcagcaaaga atgcacagta 
cctggagccc atctcacctt 
ggaccctgtt gtccgtgaga 
acaaatccag aagcagcttc 
gcagcaccag gctcagcttc 
agaactccta gaaaaggagc 
geategcaga gaacagcagc 
agtggcaagt acagaagtaa 
gaaagacact ccaactaatg 
cacggctgcc caccacacat 
atcctacaag tacacattac 
aactgaatcc tcagtcagta 
gccaactgga agtgttactg 
gcaaatggtt tcacagcaac 
ttatacctct ccttctttgc 
caatgetteg aattcactca 
tgttcctctg cctgggcagt 
tactttagag ggaaagecac 
gaaagaacaa atgegacage 
gtctcccttg gcaacaaaag 
ccgtcacaga cccctgaacc 
gctggtcatt caacagcaac 
gatccacatg aacaaactgc 
ccttgaggaa gcagaggaag 



getggageca ettgeaggae 60 
etaagecaga tggggtggct 120 
tgatcagctc agtggatgtg 180 
tagacctaag gacagacctc 240 
ageaattgea gcaggaatta 300 
tgatagcaga gtttcagaaa 360 
aggagcatat caaggaactt 420 
agaaactgga gcagcagagg 480 
ttcctcctct cagaggcaaa 540 
agcagaagct tcaagagttc 600 
gaaaaaatca ttccgtgagc 660 
cattggatca aagctctcca 720 
caggagcaca agatgeaaag 780 
gcagttctcc aggctctggt 840 
aaaatgagac ttcggttttg 900 
gcattctaat tcatgaagat 960 
ccaacattac cttggggctt 1020 
aagaaaagca gaagtgtgag 1080 
atggaggcag catcccggca 1140 
ccaacagcag ccaccaggct 1200 
aaaagcttct tgtagctggt 1260 
agagaatttc acctggcatt 1320 
gaacccagtc tgcacctttg 1380 
accagcaatt cttggagaag 1440 
tttcgaaatc tattgaacaa 1500 
agcttcaggg ggaccaggcg 1560 



WO 02/102984 



PCT/US02/19051 



14/25 



atgcaggaag acagagcgcc ctctagtggc aacagcacta ggagcgacag cagtgcttgt 1620 
gtggatgaca cactgggaca agttggggct gtgaaggtca aggaggaacc agtggacagt 1680 
gatgaagatg ctcagatcca ggaaatggaa tctggggagc aggctgcttt tatgcaacag 1740 
gtaataggca aagatttagc tccaggattt gtaattaaag tcattatctg a 1791 

<210> 10 
<211> 546 
<212> PRT 

<213> Homo sapiens 



<400> 10 



Met His Ser 


Met He 


Ser 


Ser Val Asp Val 


Lys 


Ser 


Glu Val 


Pro Val 


1 


5 










10 








15 


Gly Leu Glu 


Pro He 


Ser 


Pro 


Leu 


Asp 


Leu 


Arg 


Thr 


Asp Leu 


Arg Met 




20 








25 








30 




Met Met Pro 


Val Val 


Asp 


Pro Val Val Arg 


Glu 


Lys 


Gin Leu 


Gin Gin 


35 








40 










45 




Glu Leu Leu 


Leu He 


Gin 


Gin 


Gin 


Gin 


Gin 


He 


Gin 


Lys Gin 


Leu Leu 


50 






55 










60 






He Ala Glu 


Phe Gin 


Lys 


Gin 


His 


Glu Asn 


Leu 


Thr 


Arg Gin 


His Gin 


65 




70 










75 






80 


Ala Gin Leu 


Gin Glu 


His 


He 


Lys 


Glu Leu 


Leu 


Ala 


He Lys Gin Gin 




85 










90 








95 


Gin Glu Leu 


Leu Glu 


Lys Glu Gin Lys Leu 


Glu 


Gin 


Gin Arg Gin Glu 




100 








105 








110 




Gin Glu Val 


Glu Arg 


His Arg Arg Glu Gin 


Gin 


Leu 


Pro Pro 


Leu Arg 


115 








120 










125 




Gly Lys Asp Arg Gly 


Arg 


Glu 


Arg 


Ala Val 


Ala 


Ser 


■Thr Glu 


Val Lys 


130 






135 










140 






Gin Lys Leu 


Gin Glu 


Phe 


Leu 


Leu 


Ser Lys 


Ser 


Ala 


Thr Lys 


Asp Thr 


145 




150 










155 






160 


Pro Thr Asn Gly Lys 


Asn 


His 


Ser Val 


Ser 


Arg His 


Pro Lys 


Leu Trp 




165 










170 








175 


Tyr Thr Ala Ala His 


His 


Thr 


Ser Leu Asp 


Gin 


Ser 


Ser Pro 


Pro Leu 




180 








185 








190 




Ser Gly Thr Ser Pro 


Ser 


Tyr 


Lys 


Tyr Thr 


Leu Pro Gly Ala Gin Asp 


195 








200 










205 




Ala Lys Asp Asp Phe 


Pro 


Leu 


Arg 


Lys 


Thr 


Glu 


Ser 


Ser Val 


Ser Ser 


210 






215 










220 






Ser Ser Pro 


Gly Ser 


Gly Pro 


Ser 


Ser 


Pro 


Asn Asn Gly Pro Thr Gly 


225 




230 










235 






240 


Ser Val Thr 


Glu Asn 


Glu 


Thr 


Ser 


Val 


Leu 


Pro 


Pro 


Thr Pro 


His Ala 




245 










250 








255 


Glu Gin Met 


Val Ser 


Gin 


Gin 


Arg 


He 


Leu 


He 


His 


Glu Asp 


Ser Met 




260 








265 








270 




Asn Leu Leu 


Ser Leu 


Tyr 


Thr 


Ser 


Pro 


Ser 


Leu 


Pro 


Asn He 


Thr Leu 


275 








280 










285 




Gly Leu Pro Ala Val 


Pro 


Ser 


Gin 


Leu 


Asn 


Ala 


Ser 


Asn Ser 


Leu Lys 


290 






295 










300 




Glu Lys Gin Lys Cys 


Glu 


Thr 


Gin 


Thr 


Leu 


Arg Gin Gly Val 


Pro Leu 


305 




310 










315 






320 


Pro Gly Gin Tyr Gly 


Gly 


Ser 


He 


Pro 


Ala 


Ser 


Ser 


Ser His 


Pro His 




325 










330 








335 


Val Thr Leu Glu Gly 


Lys 


Pro 


Pro 


Asn 


Ser 


Ser 


His 


Gin Ala 


Leu Leu 




340 








345 








350 




Gin His Leu 


Leu Leu 


Lys Glu Gin Met Arg 


Gin Gin 


Lys Leu 


Leu Val 


355 








360 










365 




Ala Gly Gly Val Pro 


Leu 


His 


Pro 


Gin 


Ser 


Pro Leu Ala Thr Lys Glu 


370 






375 










380 






Arg He Ser Pro Gly 


He Arg Gly Thr His 


Lys Leu Pro Arg His Arg 


385 




390 










395 






400 


Pro Leu Asn Arg Thr 


Gin 


Ser 


Ala 


Pro 


Leu 


Pro 


Gin 


Ser Thr 


Leu Ala 
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405 








410 




415 




Gin 


ijeu 


val i. J.c uin 


din fil n 


His 


Gin 


Gin 


P"h« T.on filn T 


Gin 


Lys 






420 






425 




430 












He His 


Met 


Asn 


Lys 


T.aii T»oii Cot* T.vq 
Uc Li. Lieu. Del uy o 


Ser 


He 






435 




440 






445 






V3XU 




lieu Jjyb olll 


I: -L vj uxy 


Cay- 
OCX 




JJCU 


mi? m?i ala rzliT 

VjJLU. fiJLd ulU 


OX LI 






450 




455 








460 






Leu 


Gin 


Gly Asp Gin 


Ala Met 


Gin 


Glu 


Asp 


Arg Ala Pro Ser 


Ser 


Gly 


465 






470 








475 




480 


Asn 


Ser 


Thr Arg Ser 


Asp Ser 


Ser 


Ala 


Cys 


Val Asp Asp Thr 


Leu 


Gly 






485 








490 




495 




Gin 


Val 


Gly Ala Val 


Lys Val 


Lys 


Glu 


Glu 


Pro Val Asp Ser 


Asp 


Glu 






500 






505 




510 






Asp 


Ala 


Gin lie Gin 


Glu Met 


Glu 


Ser 


Gly 


Glu Gin Ala Ala 


Phe 


Met 






515 




520 






525 






Gin 


Gin 


Val He Gly 


Lys Asp 


Leu 


Ala 


Pro 


Gly Phe Val He 


Lys 


Val 




530 




535 








540 







He He 
545 



<210> 11 
<211> 590 
<212> PRT 

<213> Homo sapiens 



<400> 11 



Met 


His 


Ser 


Met 


He 


Ser 


Ser 


Val 


Asp 


Val 


Lys Ser Glu Val 


Pro 


Val 


1 








5 










10 




15 




Gly 


Leu 


Glu 


Pro 


He 


Ser 


Pro 


Leu 


Asp 


Leu 


Arg Thr Asp Leu 


Arg 


Met 








20 










25 




30 






Met 


Met 


Pro 


Val 


Val 


Asp 


Pro 


Val 


Val 


Arg 


Glu Lys Gin Leu 


Gin 


Gin 






35 










40 






45 






Glu 


Leu 


Leu 


Leu 


He 


Gin 


Gin 


Gin 


Gin 


Gin 


He Gin Lys Gin 


Leu 


Leu 




50 










55 








60 






He 


Ala 


Glu 


Phe 


Gin 


Lys 


Gin 


His 


Glu 


Asn 


Leu Thr Arg Gin 


His 


Gin 


65 










70 










75 




80 


Ala 


Gin 


Leu 


Gin 


Glu 


His 


He 


Lys 


Glu 


Leu 


Leu Ala lie Lys 


Gin 


Gin 










85 










90 




95 




Gin 


Glu 


Leu 


Leu 


Glu 


Lys 


Glu 


Gin 


Lys 


Leu 


Glu Gin Gin Arg 


Gin 


Glu 








100 










105 




110 






Gin 


Glu 


Val 


Glu 


Arg 


His Arg 


Arg 


Glu 


Gin 


Gin Leu Pro Pro 


Leu 


Arg 






115 










120 






125 






Gly 


Lys 


Asp 


Arg 


Gly 


Arg Glu 


Arg 


Ala 


Val 


Ala Ser Thr Glu 


Val 


Lys 




130 










135 








140 






Gin 


Lys 


Leu 


Gin 


Glu 


Phe 


Leu 


Leu 


Ser 


Lys 


Ser Ala Thr Lys 


Asp 


Thr 


145 










150 










155 




160 


Pro 


Thr 


Asn 


Gly 


Lys 


Asn 


His 


Ser 


Val 


Ser 


Arg His Pro Lys 


Leu 


Trp 










165 










170 




175 




Tyr 


Thr 


Ala 


Ala 


His 


His 


Thr 


Ser 


Leu 


Asp 


Gin Ser Ser Pro 


Pro 


Leu 








180 










185 




190 






Ser 


Gly 


Thr 


Ser 


Pro 


Ser Tyr 


Lys 


Tyr 


Thr 


Leu Pro Gly Ala 


Gin 


Asp 






195 










200 






205 






Ala 


Lys 


Asp 


Asp 


Phe 


Pro 


Leu 


Arg 


Lys 


Thr 


Ala Ser Glu Pro 


Asn 


Leu 




210 










215 








220 






Lys 


Val 


Arg 


Ser 


Arg 


Leu Lys 


Gin 


Lys 


Val 


Ala Glu Arg Arg 


Ser 


Ser 


225 










230 










235 




240 


Pro 


Leu 


Leu 


Arg 


Arg 


Lys 


Asp 


Gly 


Asn 


Val 


Val Thr Ser Phe 


Lys 


Lys 










245 










250 




255 




Arg 


Met 


Phe 


Glu 


Val 


Thr 


Glu 


Ser 


Ser 


Val 


Ser Ser Ser Ser 


Pro 


Gly 








260 










265 




270 






Ser 


Gly 


Pro 


Ser 


Ser 


Pro 


Asn 


Asn 


Gly 


Pro 


Thr Gly Ser Val 


Thr 


Glu 
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275 










280 




285 




Asn 


Glu 


Thr 


Ser 


Val 


Leu 


Pro 


Pro 


Thr Pro 


His Ala Glu Gin Met 


Val 




290 










295 






300 




Ser 


Gin 


Gin 


Arg 


Xle 


Leu 


He 


His 


Glu Asp 


Ser Met Asn Leu Leu 


Ser 


305 










310 








315 


320 


Leu 


Tyr 


Thr 


Ser 


Pro 


Ser 


Leu 


Pro 


Asn He 


Thr Leu Gly Leu Pro 


Ala 










325 








330 


335 




Val 


Pro 


Ser 


Gin 


Leu 


Asn 


Ala 


Ser 


Asn Ser 


Leu Lys Glu Lys Gin 


Lys 








340 










345 


350 




Cys 


Glu 


Thr 


Gin 


Thr 


Leu 


Arg 


Gin 


Gly Val 


Pro Leu Pro Gly Gin 


Tyr 






355 










360 




365 




Gly 


Gly 


Ser 


He 


Pro 


Ala 


Ser 


Ser 


Ser His 


Pro His Val Thr Leu 


Glu 




370 










375 






380 




Gly 


Lys 


Pro 


Pro 


Asn 


Ser 


Ser 


His 


Gin Ala 


Leu Leu Gin His Leu 


Leu 


385 










390 








395 


400 


Leu 


Lys 


Glu 


Gin 


Met 


Arg 


Gin 


Gin 


Lys Leu 


Leu Val Ala Gly Gly 


Val 










405 








410 


415 




Pro 


Leu 


His 


Pro 


Gin 


Ser 


Pro 


Leu 


Ala Thr 


Lys Glu Arg He Ser 


Pro 








420 










425 


430 




Gly 


lie 


Arg 


Gly 


Thr 


His 


Lys 


Leu 


Pro Arg 


His Arg Pro Leu Asn 


Arg 






435 










440 




445 




Thr 


Gin 


Ser 


Ala 


Pro 


Leu 


Pro 


Gin 


Ser Thr 


Leu Ala Gin Leu Val 


He 




450 










455 






460 




Gin 


Gin 


Gin 


His 


Gin 


Gin 


Phe 


Leu 


Glu Lys 


Gin Lys Gin Tyr Gin 


Gin 


465 










470 








475 


480 


Gin 


He 


His 


Met 


Asn 


Lys 


Leu 


Leu 


Ser Lys 


Ser He Glu Gin Leu 


Lys 










485 








490 


495 




Gin 


Pro 


Gly 


Ser 


His 


Leu 


Glu 


Glu 


Ala Glu 


Glu Glu Leu Gin Gly 


Asp 








500 










505 


510 




Gin 


Ala 


Met 


Gin 


Glu Asp Arg Ala 


Pro Ser 


Ser Gly Asn Ser Thr 


Arg 






515 










520 




525 




Ser 


Asp 


Ser 


Ser 


Ala Cys Val Asp 


Asp Thr 


Leu Gly Gin Val Gly 


Ala 




530 










535 






540 




Val 


Lys 


Val 


Lys 


Glu 


Glu 


Pro 


Val 


Asp Ser 


Asp Glu Asp Ala Gin 


lie 


545 










550 








555 


560 


Gin 


Glu 


Met 


Glu 


Ser 


Gly Glu Gin 


Ala Ala 


Phe Met Gin Gin Val 


He 










565 








570 


575 




Gly 


Lys 


Asp 


Leu 


Ala 


Pro Gly Phe 


Val He 


Lys Val He He 










580 










585 


590 




<210> 12 


















<211> 1084 


















<212> PRT 


















<213> Homo sapiens 














<400> 12 


















Met 


Ser 


Ser 


Gin 


Ser 


His 


Pro Asp 


Gly Leu 


Ser Gly Arg Asp Gin 


Pro 


1 








5 








10 


15 




Val 


Glu 


Leu 


Leu 


Asn Pro Ala Arg 


val Asn 


his Met Pro ber inr 


vai 








20 










25 


30 




Asp 


Val 


Ala 


Thr 


Ala 


Leu 


Pro 


Leu 


Gin Val 


Ala Pro Ser Ala Val 


Pro 






35 










40 




45 




Met 


Asp 


Leu 


Arg 


Leu Asp 


His 


Gin 


Phe Ser 


Leu Pro Val Ala Glu 


Pro 




50 










55 






60 




Ala 


Leu 


Arg 


Glu 


Gin 


Gin 


Leu 


Gin 


Gin Glu 


Leu Leu Ala Leu Lys 


Gin 


65 










70 








75 


80 


Lys 


Gin 


Gin 


He 


Gin Arg Gin 


He 


Leu He 


Ala Glu Phe Gin Arg 


Gin 










85 








90 


95 




His 


Glu 


Gin 


Leu 


Ser Arg Gin His 


Glu Ala 


Gin Leu His Glu His 


He 








100 










105 


110 




Lys 


Gin 


Gin 


Gin 


Glu Met 


Leu 


Ala 


Met Lys 


His Gin Gin Glu Leu 


Leu 
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115 120 125 

Glu His Gin Arg Lys Leu Glu Arg His Arg Gin Glu Gin Glu Leu Glu 

130 135 140 

Lys Gin His Arg Glu Gin Lys Leu Gin Gin Leu Lys Asn Lys Glu Lys 
145 150 155 "* 160 

Gly Lys Glu Ser Ala Val Ala Ser Thr Glu Val Lys Met Lys Leu Gin 

165 170 175 

Glu Phe.Val Leu Asn Lys Lys Lys Ala Leu Ala His Arg Asn Leu Asn 

180 185 190 

His Cys He Ser Ser Asp Pro Arg Tyr Trp Tyr Gly Lys Thr Gin His 

195 200 205 

Ser Ser Leu Asp Gin Ser Ser Pro Pro Gin Ser Gly Val Ser Thr Ser 

210 215 220 

Tyr Asn His Pro Val Leu Gly Met Tyr Asp Ala Lys Asp Asp Phe Pro 
225 230 235 240 

Leu Arg Lys Thr Ala Ser Glu Pro Asn Leu Lys Leu Arg Ser Arg Leu 

245 250 ~ 255 

Lys Gin Lys Val Ala Glu Arg Arg Ser Ser Pro Leu Leu Arg Arg Lys 

260 265 270 

Asp Gly Pro Val Val Thr Ala Leu Lys Lys Arg Pro Leu Asp Val Thr 

275 280 285 

Asp Ser Ala Cys Ser Ser Ala Pro Gly Ser Gly Pro Ser Ser Pro Asn 

290 295 300 

Asn Ser Ser Gly Ser Val Ser Ala Glu Asn Gly He Ala Pro Ala Val 
305 310 315 320 

Pro Ser He Pro Ala Glu Thr Ser Leu Ala His Arg Leu Val Ala Arg 

325 330 335 

Glu Gly Ser Ala Ala Pro Leu Pro Leu Tyr Thr Ser Pro Ser Leu Pro 

340 345 350 

Asn He Thr Leu Gly Leu Pro Ala Thr Gly Pro Ser Ala Gly Thr Ala 

355 360 365 

Gly Gin Gin Asp Thr Glu Arg Leu Thr Leu Pro Ala Leu Gin Gin Arg 

370 375 380 

Leu Ser Leu Phe Pro Gly Thr His Leu Thr Pro Tyr Leu Ser Thr Ser 
385 390 395 400 

Pro Leu Glu 'Arg Asp Gly Gly Ala Ala His Ser Pro Leu Leu Gin His 

. 405 410 415 

Met Val Leu Leu Glu Gin Pro Pro Ala Gin Ala Pro Leu Val Thr Gly 

420 425 430 

Leu Gly Ala Leu Pro Leu His Ala Gin Ser Leu Val Gly Ala Asp Arg 

435 440 445 

Val Ser Pro Ser He His Lys Leu Arg Gin His Arg Pro Leu Gly Arg 

450 455 460 

Thr Gin Ser Ala Pro Leu Pro Gin Asn Ala Gin Ala Leu Gin His Leu 
465 470 475 480 

Val He Gin Gin Gin His Gin Gin Phe Leu Glu Lys His Lys Gin Gin 

485 490 495 

Phe Gin Gin Gin Gin Leu Gin Met Asn Lys He He Pro Lys Pro Ser 

500 505 510 

Glu Pro Ala Arg Gin Pro Glu Ser His Pro Glu Glu Thr Glu Glu Glu 

515 520 525 

Leu Arg Glu His Gin Ala Leu Leu Asp Glu Pro Tyr Leu Asp Arg Leu 

530 535 540 

Pro Gly Gin Lys Glu Ala His Ala Gin Ala Gly Val Gin Val Lys Gin 
545 550 555 560 

Glu Pro lie Glu Ser Asp Glu Glu Glu Ala Glu Pro Pro Arg Glu Val 

565 570 ' 575 

Glu Pro Gly Gin Arg Gin Pro Ser Glu Gin Glu Leu Leu Phe Arg Gin 

580 585 590 

Gin Ala Leu Leu Leu Glu Gin Gin Arg He His Gin Leu Arg Asn Tyr 

595 600 605 

Gin Ala Ser Met Glu Ala Ala Gly He Pro Val Ser Phe Gly Gly His 
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610 






615 










620 










Arg 


Pro 


Leu 


Ser Arg Ala 


Gin 


Ser 


Ser 


Pro 


Ala 


Ser 


Ala 


Thr 


Phe 


Pro 


625 






630 










635 










640 


Val 


Ser 


Val 


Gin Glu Pro 


Pro 


Thr 


Lys 


Pro 


Arg 


Phe 


Thr 


Thr 


Gly 


Leu 








645 








650 










655 




Val 


Tyr 


Asp 


Thr Leu Met 


Leu 


Lys 


His 


Gin 


Cys 


Thr 


Cys 


Gly 


Ser 


Ser 








660 






665 










670 






Ser 


Ser 


His 


Pro Glu His 


Ala 


Gly 


Arg 


lie 


Gin 


Ser 


lie 


Trp 


Ser 


Arg 






675 






680 










685 








Leu 


Gin 


Glu 


Thr Gly Leu 


Arg 


Gly 


Lys 


Cys 


Glu 


Cys 


lie 


Arg 


Gly 


Arg 




690 






695 










700 










Lys 


Ala 


Thr 


Leu Glu Glu 


Leu 


Gin 


Thr 


Val 


His 


Ser 


Glu 


Ala 


His 


Thr 


705 






710 










715 










720 


Leu 


Leu 


Tyr 


Gly Thr Asn 


Pro 


Leu 


Asn 


Arg 


Gin 


Lys 


Leu 


Asp 


Ser 


Lys 








725 








730 










735 




Lys 


Leu 


Leu 


Gly Ser Leu 


Ala 


Ser 


Val 


Phe 


Val 


Arg 


Leu 


Pro 


Cys 


Gly 








740 






745 










750 






Gly 


Val 


Gly 


Val Asp Ser 


Asp 


Thr 


He 


Trp 


Asn 


Glu 


Val 


His 


Ser 


Ala 






755 






760 










765 








Gly 


Ala 


Ala 


Arg Leu Ala 


Val 


Gly 


Cys 


Val 


Val 


Glu 


Leu 


Val 


Phe 


Lys 




770 






775 










780 










Val 


Ala 


Thr 


Gly Glu Leu 


Lys 


Asn 


Gly 


Phe 


Ala 


Val 


Val 


Arg 


Pro 


Pro 


785 






790 










795 










800 


Gly 


His 


His 


Ala Glu Glu 


Ser 


Thr 


Pro 


Met 


Gly 


Phe 


Cys 


Tyr 


Phe 


Asn 








805 








810 










815 




Ser 


Val 


Ala 


Val Ala Ala 


Lys 


Leu 


Leu 


Gin 


Gin 


Arg 


Leu 


Ser 


Val 


Ser 








820 






825 










830 






Lys 


He 


Leu 


He Val Asp 


Trp 


Asp 


Val 


His 


His 


Gly 


Asn 


Gly 


Thr 


Gin 






835 






840 










845 








Gin 


Ala 


Phe 


Tyr Ser Asp 


Pro 


Ser 


Val 


Leu 


Tyr 


Met 


Ser 


Leu 


His 


Arg 




850 






855 










860 










Tyr 


Asp 


Asp 


Gly Asn Phe 


Phe 


Pro 


Gly 


Ser 


Gly 


Ala 


Pro 


Asp 


Glu 


Val 


865 






870 










875 










880 


Gly 


Thr 


Gly 


Pro Gly Val 


Gly 


Phe 


Asn 


Val 


Asn 


Met 


Ala 


Phe 


Thr 


Gly 








885 








890 










895 




Gly 


Leu 


Asp 


Pro Pro Met 


Gly 


Asp 


Ala 


Glu 


Tyr 


Leu 


Ala 


Ala 


Phe 


Arg 








900 






905 










910 






Thr 


Val 


Val 


Met Pro He 


Ala 


Ser 


Glu 


Phe 


Ala 


Pro 


Asp 


Val 


Val 


Leu 






915 






920 










925 








Val 


Ser 


Ser 


Gly Phe Asp 


Ala 


Val 


Glu 


Gly 


His 


Pro 


Thr 


Pro 


Leu 


Gly 




930 






935 










940 










Gly 


Tyr 


Asn 


Leu Ser Ala 


Arg 


Cys 


Phe 


Gly 


Tyr 


Leu 


Thr 


Lys 


Gin 


Leu 


945 






950 










955 










960 


Met 


Gly 


Leu 


Ala Gly Gly 


Arg 


He 


Val 


Leu 


Ala 


Leu 


Glu 


Gly 


Gly 


His 








965 








970 










975 




Asp 


Leu 


Thr 


Ala- He Cys 


Asp 


Ala 


Ser 


Glu 


Ala 


Cys 


Val 


Ser 


Ala 


Leu 








980 






985 










990 






Leu 


Gly 


Asn 


Glu Leu Asp 


Pro 


Leu 


Pro 


Glu 


Lys 


Val 


Leu 


Gin 


Gin 


Arg 






995 






1000 








1005 






Pro 


Asn 


Ala 


Asn Ala Val 


Arg 


Ser 


Met 


Glu 


Lys 


Val 


Met 


Glu 


He 


His 




1010 




1015 








1020 








Ser 


Lys 


Tyr 


Trp Arg Cys 


Leu Gin Arg 


Thr 


Thr 


Ser 


Thr Ala 


Gly 


Arg 


1025 




1030 








1035 








1040 


Ser 


Leu 


He 


Glu Ala Gin 


Thr 


Cys 


Glu 


Asn 


Glu 


Glu 


Ala 


Glu 


Thr 


Val 








1045 








1050 








1055 


Thr 


Ala 


Met 


Ala Ser Leu 


Ser 


Val 


Gly 


Val 


Lys 


Pro 


Ala 


Glu 


Lys Arg 








1060 






1065 








1070 




Pro Asp Glu 


Glu Pro Met 


Glu 


Glu 


Glu 


Pro 


Pro 


Leu 











1075 1080 



<210> 13 
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<211> 3550 
<212> DNA 

<213> Homo sapiens 
<400> 13 

ggggaagaga ggcacagaca cagataggag aagggcaccg gctggagcca cttgcaggac 60 
tgagggtttt tgcaacaaaa ccctagcagc ctgaagaact ctaagccaga tggggtggct 120 
ggacgagagc agctcttggc tcagcaaaga atgcacagta tgatcagctc agtggatgtg 180 
aagtcagaag ttcctgtggg cctggagccc atctcacctt tagacctaag gacagacctc 240 
aggatgatga tgcccgtggt ggaccctgtt gtccgtgaga agcaattgca gcaggaatta 300 
cttcttatcc agcagcagca acaaatccag aagcagcttc tgatagcaga gtttcagaaa 360 
cagcatgaga acttgacacg gcagcaccag gctcagcttc aggagcatat caaggaactt 420 
ctagccataa aacagcaaca agaactccta gaaaaggagc agaaactgga gcagcagagg 480 
caagaacagg aagtagagag gcatcgcaga gaacagcagc ttcctcctct cagaggcaaa 540 
gatagaggac gagaaagggc agtggcaagt acagaagtaa agcagaagct tcaagagttc 600 
ctactgagta aatcagcaac gaaagacact ccaactaatg gaaaaaatca ttccgtgagc 660 
cgccatccca agctctggta cacggctgcc caccacacat cattggatca aagctctcca 720 
ccccttagtg gaacatctcc atcctacaag tacacattac caggagcaca agatgcaaag 780 
gatgatttcc cccttcgaaa aactgcctct gagcccaact tgaaggtgcg gtccaggtta 840 
aaacagaaag tggcagagag gagaagcagc cccttactca ggcggaagga tggaaatgtt 900 
gtcacttcat tcaagaagcg aatgtttgag gtgacagaat cctcagtcag tagcagttct 960 
ccaggctctg gtcccagttc accaaacaat gggccaactg gaagtgttac tgaaaatgag 1020 
acttcggttt tgccccctac ccctcatgcc gagcaaatgg tttcacagca acgcattcta 1080 
attcatgaag attccatgaa cctgctaagt ctttatacct ctccttcttt gcccaacatt 1140 
accttggggc ttcccgcagt gccatcccag ctcaatgctt cgaattcact caaagaaaag 1200 
cagaagtgtg agacgcagac gcttaggcaa ggtgttcctc tgcctgggca gtatggaggc 1260 
agcatcccgg catcttccag ccaccctcat gttactttag agggaaagcc acccaacagc 1320 
agccaccagg ctctcctgca gcatttatta ttgaaagaac aaatgcgaca gcaaaagctt 1380 
cttgtagctg gtggagttcc cttacatcct cagtctccct tggcaacaaa agagagaatt 1440 
tcacctggca ttagaggtac ccacaaattg ccccgtcaca gacccctgaa ccgaacccag 1500 
tctgcacctt tgcctcagag cacgttggct cagctggtca ttcaacagca acaccagcaa 1560 
ttcttggaga agcagaagca ataccagcag cagatccaca tgaacaaact gctttcgaaa 1620 
tctattgaac aactgaagca accaggcagt caccttgagg aagcagagga agagcttcag 1680 
ggggaccagg cgatgcagga agacagagcg ccctctagtg gcaacagcac taggagcgac 1740 
agcagtgctt gtgtggatga cacactggga caagttgggg ctgtgaaggt caaggaggaa 1800 
ccagtggaca gtgatgaaga tgctcagatc caggaaatgg aatctgggga gcaggctgct 1860 
tttatgcaac aggtaatagg caaagattta gctccaggat ttgtaattaa agtcattatc 1920 
tgacctttcc tggaacccac gcacacacgt gcgctctctg tgcgccaagc tccgctggct 1980 
gcggttggca tggatggatt agagaaacac cgtctcgtct ccaggactca ctcttcccct 2040 
gctgcctctg ttttacctca cccagcaatg gaccgccccc tccagcctgg ctctgcaact 2100 
ggaattgcct atgacccctt gatgctgaaa caccagtgcg tttgtggcaa ttccaccacc 2160 
caccctgagc atgctggacg aatacagagt atctggtcac gactgcaaga aactgggctg 2220 
ctaaataaat gtgagcgaat tcaaggtcga aaagccagcc tggaggaaat acagcttgtt 2280 
cattctgaac atcactcact gttgtatggc accaaccccc tggacggaca gaagctggac 2340 
cccaggatac tcctaggtga tgactctcaa aagttttttt cctcattacc ttgtggtgga 2400 
cttggggtgg acagtgacac catttggaat gagctacact cgtccggtgc tgcacgcatg 2460 
gctgttggct gtgtcatcga gctggcttcc aaagtggcct caggagagct gaagaatggg 2520 
tttgctgttg tgaggccccc tggccatcac gctgaagaat ccacagccat ggggttctgc 2580 
ttttttaatt cagttgcaat taccgccaaa tacttgagag accaactaaa tataagcaag 2640 
atattgattg tagatctgga tgttcaccat ggaaacggta cccagcaggc cttttatgct 2700 
gaccccagca tcctgtacat ttcactccat cgctatgatg aagggaactt tttccctggc 2760 
agtggagccc caaatgaggt tcggtttatt tctttagagc cccactttta tttgtatctt 2820 
tcaggtaatt gcattgcatg attaccccta attttcttgt cctttgctgg tgttttaaat 2880 
tacacgagat tactgaattg tcccatggga ccaagaacca gtgcagaaca agtgcataac 2940 
ccagagcact gtttgtcagg gaaggttggg ctgatttgat gtgttgtttg atgtttattt 3000 
caagagctcc catgtgcttg ttttcctctc ttcttgcttt cttccatttg ctctcttctc 3060 
tgcccaccgt ggtgtgtctt tctcttccca ggttggaaca ggccttggag aagggtacaa 3120 
tataaatatt gcctggacag gtggccttga tcctcccatg ggagatgttg agtaccttga 3180 
agcattcagg accatcgtga agcctgtggc caaagagttt gatccagaca tggtcttagt 3240 
atctgctgga tttgatgcat tggaaggcca cacccctcct ctaggagggt acaaagtgac 3300 
ggcaaaatgt tttggtcatt tgacgaagca attgatgaca ttggctgatg gacgtgtggt 3360 
gttggctcta gaaggaggac atgatctcac agccatctgt gatgcatcag aagcctgtgt 3420 
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aaatgccctt ctaggaaatg agctggagcc 
gaatatgaat gctgttattt ctttacagaa 
gttctcttaa 

<210> 14 

<211> 7699 - 

<212> DNA 

<213> Homo sapiens 

<400> 14 

cccattcgcc attcaggctg cgcaactgtt 
tattacgcca gctggcgaaa gggggatgtg 
gggttttccc agtcacgacg ttgtaaaacg 
ttggccatta gccatattat tcattggtta 
attgcatacg ttgtatccat atcataatat 
accgccatgt tgacattgat tattgactag 
agttcatagc ccatatatgg agttccgcgt 
cgaccgccca gcgacccccg cccgttgacg 
ccaataggga ctttccattg acgtcaatgg 
gcagtacatc aagtgtatca tatgccaagt 
tggcccgcct agcattatgc ccagtacatg 
atctacgtat tagtcatcgc tattaccatg 
cgtggatagc ggtttgactc acggggattt 
agtttgtttt ggcaccaaaa tcaacgggac 
ttgacgcaaa tgggcggtag gcgtgtacgg 
gtgaaccgtc agaattcaag cttgcggccg 
gcacagtatg atcagctcag tggatgtgaa 
ctcaccttta gacctaagga cagacctcag 
ccgtgagaag caattgcagc aggaattact 
gcagcttctg atagcagagt ttcagaaaca 
tcagcttcag gagcatatca aggaacttct 
aaaggagcag aaactggagc agcagaggca 
acagcagctt cctcctctca gaggcaaaga 
agaagtaaag cagaagcttc aagagttcct 
aactaatgga aaaaatcatt ccgtgagccg 
ccacacatca ttggatcaaa gctctccacc 
cacattacca ggagcacaag atgcaaagga 
gcccaacttg aaggtgcggt ccaggttaaa 
cttactcagg cggaaggatg gaaatgttgt 
gacagaatcc tcagtcagta gcagttctcc 
gccaactgga agtgttactg aaaatgagac 
gcaaatggtt tcacagcaac gcattctaat 
ttatacctct ccttctttgc ccaacattac 
caatgcttcg aattcactca aagaaaagca 
tgttcctctg cctgggcagt atggaggcag 
tactttagag ggaaagccac ccaacagcag 
gaaagaacaa atgcgacagc aaaagcttct 
gtctcccttg gcaacaaaag agagaatttc 
ccgtcacaga cccctgaacc gaacccagtc 
gctggtcatt caacagcaac accagcaatt 
gatccacatg aacaaactgc tttcgaaatc 
ccttgaggaa gcagaggaag agcttcaggg 
ctctagtggc aacagcacta ggagcgacag 
agttggggct gtgaaggtca aggaggaacc 
ggaaatggaa tctggggagc aggctgcttt 
cacacgtgcg ctctctgtgc gccaagctcc 
gaaacaccgt ctcgtctcca ggactcactc 
agcaatggac cgccccctcc agcctggctc 
gctgaaacac cagtgcgttt gtggcaattc 
acagagtatc tggtcacgac tgcaagaaac 
aggtcgaaaa gccagcctgg aggaaataca 
gtatggcacc aaccccctgg acggacagaa 



acttgcagaa gatattctcc accaaagccc 3480 
gatcattgaa attcaaagta tgtctttaaa 3540 

3550 



gggaagggcg atcggtgcgg gcctcttcgc 60 
ctgcaaggcg attaagttgg gtaacgccca 120 
acggccagtg ccaagctgat ctaatcaata 180 
tatagcataa atcaatattg gctattggcc 240 
gtacatttat attggctcat gtccaacatt 3 00 
ttattaatag taatcaatta cggggtcatt 3 60 
tacataactt acggtaaatg gcccgcctgg 420 
tcaatagtga cgtatgttcc catagtaacg 480 
gtggagtatt tacggtaaac tgcccacttg 540 
ccgcccccta ttgacgtcaa tgacggtaaa 600 
accttacggg agtttcctac ttggcagtac 660 
gtgatgcggt tttggcagta caccaatggg 720 
ccaagtctcc accccattga cgtcaatggg 780 
tttccaaaat gtcgtaataa ccccgccccg 840 
tgggaggtct atataagcag agctcgttta 900 
cagatctatc gatctgcagg atatcaccat 960 
gtcagaagtt cctgtgggcc tggagcccat 1020 
gatgatgatg cccgtggtgg accctgttgt 1080 
tcttatccag cagcagcaac aaatccagaa 1140 
gcatgagaac ttgacacggc agcaccaggc 1200 
agccataaaa cagcaacaag aactcctaga 1260 
agaacaggaa gtagagaggc atcgcagaga 1320 
tagaggacga gaaagggcag tggcaagtac 1380 
actgagtaaa tcagcaacga aagacactcc 1440 
ccatcccaag ctctggtaca cggctgccca 1500 
ccttagtgga acatctccat cctacaagta 1560 
tgatttcccc cttcgaaaaa ctgcctctga 1620 
acagaaagtg gcagagagga gaagcagccc 1680 
cacttcattc aagaagcgaa tgtttgaggt 1740 
aggctctggt cccagttcac caaacaatgg 1800 
ttcggttttg ccccctaccc ctcatgccga 1860 
tcatgaagat tccatgaacc tgctaagtct 1920 
cttggggctt cccgcagtgc catcccagct 1980 
gaagtgtgag acgcagacgc ttaggcaagg 2040 
catcccggca tcttccagcc accctcatgt 2100 
ccaccaggct ctcctgcagc atttattatt 2160 
tgtagctggt ggagttccct tacatcctca 2220 
acctggcatt agaggtaccc acaaattgcc 2280 
tgcacctttg cctcagagca cgttggctca 2340 
cttggagaag cagaagcaat accagcagca 2400 
tattgaacaa ctgaagcaac cag^cagtca 2460 
ggaccaggcg atgcaggaag acagagcgcc 2520 
cagtgcttgt gtggatgaca cactgggaca 2580 
agtggacagt gatgaagatg ctcagatcca 2640 
tatgcaacag cctttcctgg aacccacgca 2700 
gctggctgcg gttggcatgg atggattaga 2760 
ttcccctgct gcctctgttt tacctcaccc 2820 
tgcaactgga attgcctatg accccttgat 2880 
caccacccac cctgagcatg ctggacgaat 2940 
tgggctgcta aataaatgtg agcgaattca 3000 
gcttgttcat tctgaacatc actcactgtt 3060 
gctggacccc aggatactcc taggtgatga 3120 
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ctctcaaaag tttttttcct cattaccttg tggtggactt ggggtggaca gtgacaccat 3180 
ttggaatgag ctacactcgt ccggtgctgc acgcatggct gttggctgtg tcatcgagct 3240 
ggcttccaaa gtggcctcag gagagctgaa gaatgggttt gctgttgtga ggccccctgg 3300 
ccatcacgct gaagaatcca cagccatggg gttctgcttt tttaattcag ttgcaattac 3360 
cgccaaatac ttgagagacc aactaaatat aagcaagata ttgattgtag atctggatgt 3420 
tcaccatgga aacggtaccc agcaggcctt ttatgctgac cccagcatcc tgtacatttc 3480 
actccatcgc tatgatgaag ggaacttttt ccctggcagt ggagccccaa atgaggttgg 3540 
aacaggcctt ggagaagggt acaatataaa tattgcctgg acaggtggcc ttgatcctcc 3600 
catgggagat gttgagtacc ttgaagcatt caggaccatc gtgaagcctg tggccaaaga 3660 
gtttgatcca gacatggtct tagtatctgc tggatttgat gcattggaag gccacacccc 3720 
tcctctagga gggtacaaag tgacggcaaa atgttttggt catttgacga agcaattgat 3780 
gacattggct gatggacgtg tggtgttggc tctagaagga ggacatgatc tcacagccat 3840 
ctgtgatgca tcagaagcct gtgtaaatgc ccttctagga aatgagctgg agccacttgc 3900 
agaagatatt ctccaccaaa gcccgaatat gaatgctgtt atttctttac agaagatcat 3960 
tgaaattcaa agtatgtctt taaagttctc tggatccggt accagattac aaggacgacg 4020 
atgacaagta gatcccgggt ggcatccctg tgacccctcc ccagtgcctc tcctggcctt 4080 
ggaagttgcc actccagtgc ccaccagcct tgtcctaata aaattaagtt gcatcatttt 4140 
gtctgactag gtgtcctcta taatattatg gggtggaggg gggtggtatg gagcaagggg 4200 
cccaagttgg gaagacaacc tgtagggcct gcggggtcta ttcgggaacc aagctggagt 4260 
gcagtggcac aatcttggct cactgcaatc tccgcctcct gggttcaagc gattctcctg 4320 
cctcagcctc ccgagttgtt gggattccag gcatgcatga ccaggctcag ctaatttttg 43 80 
tttttttggt agagacgggg tttcaccata ttggccaggc tggtctccaa ctcctaatct 4440 
caggtgatct acccaccttg gcctcccaaa ttgctgggat tacaggcgtg aaccactgct 4500 
cccttccctg tccttctgat tttaaaataa ctataccagc aggaggacgt ccagacacag 4560 
cataggctac ctgccatggc ccaaccggtg ggacatttga gttgcttgct tggcactgtc 4620 
ctctcatgcg ttgggtccac tcagtagatg cctgttgaat tgggtacgcg gccagcttct 4680 
gtggaatgtg tgtcagttag ggtgtggaaa gtccccaggc tccccagcag gcagaagtat 4740 
gcaaagcatg catctcaatt agtcagcaac caggtgtgga aaagtcccca ggctccccag 4800 
caggcagaag tatgcaaagc atgcatctca attagtcagc aaccatagtc ccgcccctaa 4860 
ctccgcccat cccgccccta actccgccca gttccgccca ttctccgccc catggctgac 4920 
taattttttt tatttatgca gaggccgagg ccgcctcggc ctctgagcta ttccagaagt 4980 
agtgaggagg cttttttgga ggcctaggct tttgcaaaaa gctcctcgag gaactgaaaa 5040 
accagaaagt taattcccta tagtgagtcg tattaaattc gtaatcatgg tcatagctgt 5100 
ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 5160 
agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 5220 
tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 5280 
cggggagagg cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc 5340 
gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggfctat 5400 
ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 5460 
ggaaccgtaa aaaggccgcg ttgcfcggcgt ttttccatag gctccgcccc cctgacgagc 5520 
atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 5580 
aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 5640 
gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcaatgc tcacgctgta 5700 
ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 5760 
ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 5820 
acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 5880 
gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga agaacagtat 5940 
ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 6000 
ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 6060 
gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 612 0 
ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 6180 
agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 6240 
ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 6300 
gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 6360 
catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 6420 
cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 6480 
cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 6540 
gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 6600 
tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 6660 
gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 6720 
tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 6780 
gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 6840 
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gaccgagttg ctcttgcccg gcgtcaatac 
taaaagtgct catcattgga aaacgttctt 
tgttgagatc cagttcgatg taacccactc 
ctttcaccag cgtttctggg tgagcaaaaa 
taagggcgac acggaaatgt tgaatactca 
tttatcaggg ttattgtctc atgagcggat 
aaataggggt tccgcgcaca tttccccgaa 
cattaagcgc ggcgggtgtg gtggttacgc 
tagcgcccgc tcctttcgct ttcttccctt 
gtcaagctct aaatcggggc atccctttag 
accccaaaaa acttgattag ggtgatggtt 
tttttcgccc tttgacgttg gagtccacgt 
gaacaacact caaccctatc tcggtctatt 
cggcctattg gttaaaaaat gagctgattt 
tattaaacgt ttacaattt 



gggataatac cgcgccacat agcagaactt 6900 
cggggcgaaa actctcaagg atcttaccgc 6960 
gtgcacccaa ctgatcttca gcatctttta 7020 
caggaaggca aaatgccgca aaaaagggaa 7080 
tactcttcct ttttcaatat tattgaagca 7140 
acatatttga atgtatttag aaaaataaac 7200 
aagtgccacc tgacgcgccc tgtagcggcg 7260 
gcagcgtgac cgctacactt gccagcgccc 7320 
cctttctcgc cacgttcgcc ggctttcccc 7380 
ggttccgatt tagtgcttta cggcacctcg 7440 
cacgtagtgg gccatcgccc tgatagacgg 7500 
tctttaatag tggactcttg ttccaaactg 7560 
cttttgattt ataagggatt ttgccgattt 7620 
aacaaaaatt taacgcgaat tttaacaaaa 7680 

7699 



<210> 15 

<211> 7303 

<212> DNA 

<213> Homo sapiens 

<400> 15 

cccattcgcc attcaggctg cgcaactgtt 
tattacgcca gctggcgaaa gggggatgtg 
gggttttccc agtcacgacg ttgtaaaacg 
ttggccatta gccatattat tcattggtta 
attgcatacg ttgtatccat atcataatat 
accgccatgt tgacattgat tattgactag 
agttcatagc ccatatatgg agttccgcgt 
cgaccgccca gcgacccccg cccgttgacg 
ccaataggga ctttccattg acgtcaatgg 
gcagtacatc aagtgtatca tatgccaagt 
tggcccgcct agcattatgc ccagtacatg 
atctacgtat tagtcatcgc tattaccatg 
cgtggatagc ggtttgactc acggggattt 
agtttgtttt ggcaccaaaa tcaacgggac 
ttgacgcaaa tgggcggtag gcgtgtacgg 
gtgaaccgtc agaattcaag cttgcggccg 
gcacagtatg atcagctcag tggatgtgaa 
ctcaccttta gacctaagga cagacctcag 
ccgtgagaag caattgcagc aggaattact 
gcagcttctg atagcagagt ttcagaaaca 
tcagcttcag gagcatatca aggaacttct 
aaaggagcag aaactggagc agcagaggca 
acagcagctt cctcctctca gaggcaaaga 
agaagtaaag cagaagcttc aagagttcct 
aactaatgga aaaaatcatt ccgtgagccg 
ccacacatca ttggatcaaa gctctccacc 
cacattacca ggagcacaag atgcaaagga 
gcccaacttg aaggtgcggt ccaggttaaa 
cttactcagg cggaaggatg gaaatgttgt 
gacagaatcc tcagtcagta gcagttctcc 
gccaactgga agtgttactg aaaatgagac 
gcaaatggtt tcacagcaac gcattctaat 
ttatacctct ccttctttgc ccaacattac 
caatgcttcg aattcactca aagaaaagca 
tgttcctctg cctgggcagt atggaggcag 
tactttagag ggaaagccac ccaacagcag 
gaaagaacaa atgcgacagc aaaagcttct 
gtctcccttg gcaacaaaag agagaatttc 



gggaagggcg atcggtgcgg gcctcttcgc 60 
ctgcaaggcg attaagttgg gtaacgccca 120 
acggccagtg ccaagctgat ctaatcaata 180 
tatagcataa atcaatattg gctattggcc 240 
gtacatttat attggctcat gtccaacatt 300 
ttattaatag taatcaatta cggggtcatt 360 
tacataactt acggtaaatg gcccgcc tgg 420 
tcaatagtga cgtatgttcc catagtaacg 480 
gtggagtatt tacggtaaac tgcccacttg 540 
ccgcccccta ttgacgtcaa tgacggtaaa 600 
accttacggg agtttcctac ttggcagtac 660 
gtgatgcggt tttggcagta caccaatggg 720 
ccaagtctcc accccattga cgtcaatggg 780 
tttccaaaat gtcgtaataa ccccgccccg 840 
tgggaggtct atataagcag agctcgttta 900 
cagatctatc gatctgcagg atatcaccat 960 
gtcagaagtt cctgtgggcc tggagcccat 1020 
gatgatgatg cccgtggtgg accctgttgt 1080 
tcttatccag cagcagcaac aaatccagaa 1140 
gcatgagaac ttgacacggc agcaccaggc 1200 
agccataaaa cagcaacaag aactcctaga 1260 
agaacaggaa gtagagaggc atcgcagaga 1320 
tagaggacga gaaagggcag tggcaagtac 1380 
actgagtaaa tcagcaacga aagacactcc 1440 
ccatcccaag ctctggtaca cggctgccca 1500 
ccttagtgga acatctccat cctacaagta 1560 
tgatttcccc cttcgaaaaa ctgcctctga 1620 
acagaaagtg gcagagagga gaagcagccc 1680 
cacttcattc aagaagcgaa tgtttgaggt 1740 
aggctctggt cccagttcac caaacaatgg 1800 
ttcggttttg ccccctaccc ctcatgccga 1860 
tcatgaagat tccatgaacc tgctaagtct 1920 
cttggggctt cccgcagtgc catcccagct 1980 
gaagtgtgag acgcagacgc ttaggcaagg 2040 
catcccggca tcttccagcc accctcatgt 2100 
ccaccaggct ctcctgcagc atttattatt 2160 
tgtagctggt ggagttccct tacatcctca 2220 
acctggcatt agaggtaccc acaaattgcc 2280 
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ccgtcacaga cccctgaacc gaacccagtc tgcacctttg cctcagagca cgttggctca 2340 
gctggtcatt caacagcaac accagcaatt cttggagaag cagaagcaat accagcagca 2400 
gatccacatg aacaaactgc tttcgaaatc tattgaacaa ctgaagcaac caggcagtca 2460 
ccttgaggaa gcagaggaag agcttcaggg ggaccaggcg atgcaggaag acagagcgcc 2520 
ctctagtggc aacagcacta ggagcgacag cagtgcttgt gtggatgaca cactgggaca 2580 
agttggggct gtgaaggtca aggaggaacc agtggacagt gatgaagatg ctcagatcca 2640 
ggaaatggaa tctggggagc aggctgcttt tatgcaacag cctttcctgg aacccacgca 2700 
cacacgtgcg ctctctgtgc gccaagctcc gctggctgcg gttggcatgg atggattaga 2760 
gaaacaccgt ctcgtctcca ggactcactc ttcccctgct gcctctgttt tacctcaccc 2820 
agcaatggac cgccccctcc agcctggctc tgcaactgga attgcctatg accccttgat 2880 
gctgaaacac cagtgcgttt gtggcaattc caccacccac cctgagcatg ctggacgaat 2940 
acagagtatc tggtcacgac tgcaagaaac tgggctgcta aataaatgtg agcgaattca 3000 
aggtcgaaaa gccagcctgg aggaaataca gcttgttcat tctgaacatc actcactgtt 3060 
gtatggcacc aaccccctgg acggacagaa gctggacccc aggatactcc taggtgatga 3120 
ctctcaaaag tttttttcct cattaccttg tggtggactt ggggtggaca gtgacaccat 3180 
ttggaatgag ctacactcgt ccggtgctgc acgcatggct gttggctgtg tcatcgagct 3240 
ggcttccaaa gtggcctcag gagagctgaa gaatgggttt gctgttgtga ggccccctgg 3300 
ccatcacgct gaagaatcca cagccatggg gttctgcttt tttaattcag ttgcaattac 3360 
cgccaaatac ttgagagacc aactaaatat aagcaagata ttgattgtag atctggatgt 3420 
tcaccatgga aacggtaccc agcaggcctt ttatgctgac cccagcatcc tgtacatttc 3480 
actccatcgc tatgatgaag ggaacttttfc ccctggcagt ggagccccaa atgaggttcg 3540 
gtttatttct ttagagcccc acttttattt gtatctttca ggtaattgca ttgcaggatc 3600 
cggtaccaga ttacaaggac gacgatgaca agtagatccc gggtggcatc cctgtgaccc 3660 
ctccccagtg cctctcctgg ccttggaagt tgccactcca gtgcccacca gccttgtcct 3720 
aataaaatta agttgcatca ttttgtctga ctaggtgtcc tctataatat tatggggtgg 3780 
aggggggtgg tatggagcaa ggggcccaag ttgggaagac aacctgtagg gcctgcgggg 3840 
tctattcggg aaccaagctg gagtgcagtg gcacaatctt ggctcactgc aatctccgcc 3900 
tcctgggttc aagcgattct cctgcctcag cctcccgagt tgttgggatt ccaggcatgc 3960 
atgaccaggc tcagctaatt tttgtttttt tggtagagac ggggtttcac catattggcc 4020 
aggctggtct ccaactccta atctcaggtg atctacccac cttggcctcc caaattgctg 4080 
ggattacagg cgtgaaccac tgctcccttc cctgtccttc tgattttaaa ataactatac 4140 
cagcaggagg acgtccagac acagcatagg ctacctgcca tggcccaacc ggtgggacat 4200 
ttgagttgct tgcttggcac tgtcctctca tgcgttgggt ccactcagta gatgcctgtt 4260 
gaattgggta cgcggccagc ttctgtggaa tgtgtgtcag ttagggtgtg gaaagtcccc 4320 
aggctcccca gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccaggtg 4380 
tggaaaagtc cccaggctcc ccagcaggca gaagtatgca aagcatgcat ctcaattagt 4440 
cagcaaccat agtcccgccc ctaactccgc ccatcccgcc cctaactccg cccagttccg 4500 
cccattctcc gccccatggc tgactaattt tttttattta tgcagaggcc gaggccgcct 4560 
cggcctctga gctattccag aagtagtgag gaggcttttt tggaggccta ggcttttgca 4620 
aaaagctcct cgaggaactg aaaaaccaga aagttaattc cctatagtga gtcgtattaa 4680 
attcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgctc acaattccac 4740 
acaacatacg agccggaagc ataaagtgta aagcctgggg tgcctaatga gtgagctaac 4800 
tcacattaat tgcgttgcgc tcactgcccg ctttccagtc gggaaacctg tcgtgccagc 4860 
tgcattaatg aatcggccaa cgcgcgggga gaggcggttt gcgtattggg cgctcttccg 4920 
cttcctcgct cactgactcg ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc 4980 
actcaaaggc ggtaatacgg ttatccacag aatcagggga taacgcagga aagaacatgt 5040 
gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc 5100 
ataggctccg cccccctgac gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa 5160 
acccgacagg actataaaga taccaggcgt ttccccctgg aagctccctc gtgcgctctc 5220 
ctgttccgac cctgccgctt accggatacc tgtccgcctt tctcccttcg ggaagcgtgg 5280 
cgctttctca atgctcacgc tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc 5340 
tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg cgccttatcc ggtaactatc 5400 
gtcttgagtc caacccggta agacacgact tatcgccact ggcagcagcc actggtaaca 5460 
ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact 5520 
acggctacac tagaagaaca gtatttggta tctgcgctct gctgaagcca gttaccttcg 5580 
gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt 5640 
ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct 5700 
tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga 5760 
gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa 5820 
tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac 5880 
ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga 5940 
taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc 6000 
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cacgctcacc ggctccagat ttatcagcaa 
gaagtggtcc tgcaacttta tccgcctcca 
gagtaagtag ttcgccagtt aatagtttgc 
tggtgtcacg ctcgtcgttt ggtatggctt 
gagttacatg atcccccatg ttgtgcaaaa 
ttgtcagaag taagttggcc gcagtgttat 
ctcttactgt catgccatcc gtaagatgct 
cattctgaga atagtgtatg cggcgaccga 
ataccgcgcc acatagcaga actttaaaag 
gaaaactctc aaggatctta ccgctgttga 
ccaactgatc ttcagcatct tttactttca 
ggcaaaatgc cgcaaaaaag ggaataaggg 
tcctttttca atattattga agcatttatc 
ttgaatgtat ttagaaaaat aaacaaatag 
cacctgacgc gccctgtagc ggcgcattaa 
tgaccgctac acttgccagc gccctagcgc 
tcgccacgtt cgccggcttt ccccgtcaag 
gatttagtgc tttacggcac ctcgacccca 
gtgggccatc gccctgatag acggtttttc 
atagtggact cttgttccaa actggaacaa 
atttataagg gattttgccg atttcggcct 
aatttaacgc gaattttaac aaaatattaa 



taaaccagcc agccggaagg gccgagcgca 6060 
tccagtctat taattgttgc cgggaagcta 6120 
gcaacgttgt tgccattgct acaggcatcg 6180 
cattcagctc cggttcccaa cgatcaaggc 6240 
aagcggttag ctccttcggt cctccgatcg 63 00 
cactcatggt tatggcagca ctgcataatt 6360 
tttctgtgac tggtgagtac tcaaccaagt 6420 
gttgctcttg cccggcgtca atacgggata 6480 
tgctcatcat tggaaaacgt tcttcggggc 6540 
gatccagttc gatgtaaccc actcgtgcac 6600 
ccagcgtttc tgggtgagca aaaacaggaa 6660 
cgacacggaa atgttgaata ctcatactct 6720 
agggttattg tctcatgagc ggatacatat 6780 
gggttccgcg cacatttccc cgaaaagtgc 6840 
gcgcggcggg tgtggtggtt acgcgcagcg 6900 
ccgctccttt cgctttcttc ccttcctttc 6960 
ctctaaatcg gggcatccct ttagggttcc 7020 
aaaaacttga ttagggtgat ggttcacgta 7080 
gccctttgac gttggagtcc acgttcttta 7140 
cactcaaccc tatctcggtc tattcttttg 7200 
attggttaaa aaatgagctg atttaacaaa 7260 
acgtttacaa ttt 73 03 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 16 

ccatggaaac ggtacccagc aggc 24 

<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 17 

cactccatcg ctatgatgaa ggg 23 

<210> 18 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 18 

agttcccttc atcatagcga tgg 23 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Primer used to amplify human DNA 
<400> 19 

aatgtacagg atgctggggt 

<210> 20 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 20 

cccttgtagc tggtggagtt ccctt 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 21 

tgtgtcatcg agctggcttc 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer used to amplify human DNA 
<400> 22 

atcttctgca agtggctcca 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
27 December 2002 (27.12.2002) 




PCT 



(10) International Publication Number 

WO 02/102984 A3 



(51) International Patent Classification 7 : 
9/00, 9/14, 1/20, 15/00, C07H 21/04 



C12N9/78, 



(21) International Application Number: PCT/US02/ 19051 

(22) International Filing Date: 14 June 2002 (14.06.2002) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

60/298,173 
60/311,686 
60/316,995 



14 June 2001 (14.06.2001) US 
10 August 2001 (10.08.2001) US 
4 September 2001 (04.09.2001) US 



(71) Applicant (for all designated States except US): 
SLOAN-KETTERING INSTITUTE FOR CAN- 
CER RESEARCH [US/US]; 1275 York Avenue, New 
York, NY 10021 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): RICHON, Victo- 
ria [US/US]; 1 60 Theodore Fremd Street, #A11, Rye, NY 
10580 (US). ZHOU, Xianbo [CN/US]; 43 Bradley Street, 
Dobbs Ferry, NY 10522 (US). RIFKJND, Richard, A. 
[US/US] ; 425 East 58th Street, #48 A, New York, NY 1 0022 
(US). MARKS, Paul, A. [US/US]; 7 Rossiter Road, Wash- 
ington, CT 06793 (US). 



(74) Agents: BROOK, David, E. et al.; Hamilton, Brook, 
Smith & Reynolds, P.C., 530 Virginia Road, P.O. Box 
9133, Concord, MA 01742-9133 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FT, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, OM, PH, PL, PT, RO, RU, SD, SE, SG, 
SI, SK, SL, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, 
VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, 
NE, SN, TD, TG). 

Published: 

— with international search report 

(88) Date of publication of the international search report: 

13 November 2003 

For two- letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations " appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 

00 
ON 



5^ (54) Title: HDAC9 POLYPEPTIDES AND POLYNUCLEOTIDES AND USES THEREOF 

n 

° (57) Abstract: The present invention features substantially pure HDAC9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), an 
HDRP(ANLS) polypeptides, and isolated nucleic acid molecules encoding those polypeptides. The present invention also features 
J> vectors containing//D^C9, HDAC9a, HDAC9(ANLS), HDAC9a(ANLS), and HDRP(ANLS) nucleic acid sequences, and cells 
^ containing those vectors. 



INTERNATIONAL SEARCH REPORT 


International applied' — * r ~- 
PCT/US02/19051 1 - 




A CLASSIFICATION OF SUBJECT MATTER 

IPC(7) : C12N 9/78, 9/00, 9/14, 1/20, 15/00; C07H 21/04 

US CL : 435/227, 183, 195, 252.3, 320.1; 536/23.2 
Accordina to International Patent Classification (IPC) or to both national classification and IPC 




B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 435/227, 183, 195, 252.3, 320.1; 536/23.2 



Dwnmienlation searched other than minimum oocumentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
STN AND WEST. Sequence search in Swissprot, EST, N-GeneSeq, PIR_71 , SPTREMBL & issued US patents. 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



A 



NAGASE et al. Prediction of Coding Sequences of Unidentified Human Genes. XI. The 
Complete Sequences of 100 New cDNA Clones from Brain Which Code for Large 
Proteins in Vitro. DNA Research November 1998, Vol 5, pages 277-286. See Table 1, 
Accession No. AB018287 is 58.8% similar to DNA sequence of SEQ IF SEQ ID NO : 1, 
claim 4(g). 

ZHOU et al. Cloning and Characterization of ahistone deacetylase, HDAC9. PNAS, 11 
September 2001, Vol. 98, No. 19, pages 10572-10577. 

WANG et al. HDAC4, a Human Histone Deacetylase Related to Yeast HDA1, Is a 
Transcriptional Corepressor. Molecular and Cellular Biology, November 1999, Vol. 19, 
No. 11, pages 7816-7827. 



1-9, 29 
1-9, 29 



EZI Furtta documents are listed in the continuation of Box C. [~| See patent family annex. 



Special categories of cited documents: 



t defining the general state of the art which is not considered to be 
of particular relevance 

•E* earlier application or patent published on or after the international filing date 

document which may throw doubts on priority cfaimfs) or which is cited to 
establish the publication dale of another citation or other special reason (as 
specified) 

document referring to an oral disclosure, use, exhibition or other means 



*T* later document published after the international filing date or priority 

date and not in conflict with the application but cited to understand the 
principle or theory underlying the invention 

"X" document of particular relevance; the claimed invention cannot be 

considered novel or cannot be considered to involve an inventive step 
when the document Is taken alone 

"Y" document of particular relevance; the claimed invention cannot be 

considered to involve an inventive step when the document is 
combined with one or mare other such documents , such combination 
being obvious to a pen on skilled in the art 



I published prior to the international filing date but later than the 
priority date claimed 



document member of the same patent 



Date of the actual completion of the international search 



30 October 2002 (30. 10.2002^ 



Name and mailing address of the ISA/US 
Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 

Facsirntte No. (703)305-3230 



Date of nail 

I 



search report 



f efoinfcd Saidha 

No. (703)308-0196 




FormPCT/ISA/210 (second sheet) (July 1998) 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US02/190 5I 



Box I Observations where certain claims were found unsearchable (Continuation of Item 1 of first sheet) 



This international report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
(ZD Claim Nos,: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. Q ClaimNos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to 
such an extent that no meaningful international search can be carried out, specifically: 



3. ClaimNos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 
6.4(a). * 



Box II Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Continuation Sheet 



I . I J As all required additional search fees were timely paid by the applicant, this international search report covers all 

searchable claims. 

2 * EH M searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

3. As only some of the required additional search fees were timely paid by the applicant, this international search 

report covers only those claims for which fees were paid, specifically claims Nos.: 



4. |/n] N° required additional search fees were timely paid by the applicant. Consequently, this international search report 
is restricted to the invention first mentioned in the claims; it is covered by claims Nos. : 1-9 & 29 (SEQ ID NOS : 1 
&2) 

Remark on Protest 1 | The additional search fees were accompanied by the applicant's protest 
| ~) No protest accompanied the payment of additional search fees. 
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BOX O. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING 

This application contains the following inventions or groups of inventions which are not so linked as to form a single general 
inventive concept under PCT Rule 13.1. In order for all inventions to be examined, the appropriate additional examination fees must 
be paid. 

Group I, claim(s) 1-9, 29, drawn to isolated nucleic acid, the encoded protein and protein composition. 
Group n, claim(s) 10, drawn to antibody. 

Group HI, claim(s) 1 1-13, drawn to a method of identifying a compound - modulate DNA expression. 

Group IV, claim(s) 14-19, 33, drawn to a method of identifying a compound that modulate enzymatic activity. 

Group V, claim(s) 20-25, 34, drawn to a method of identifying a compound that modulate transcriptional repression activity of the 
polypeptide. 

Group VI, claim(s) 26-27, drawn to a method of identifying a compound that modulate expression of a nucleic acid molecule. 

Group VII, claim(s) 28, drawn to a method of identifying a polypeptide that interacts with a polypeptide of claim 1 in a two-hybrid ■ 
system 

Group VIII, claim(s) 30-32, drawn to a method of diagnosing a cell proliferation disease. 

This application contains claims directed to more than one species of the generic invention- These species are deemed to lack unity of 
invention because they are not so linked as to form a single general inventive concept under PCT Rule 13. 1 . 

In order for more than one species to be examined, the appropriate additional examination fees must be paid. The species are as 
follows: 

1. SEQ ID NO : 1 and 2 [HDAC9]. 

2. SEQ ID NO : 3 and 4 [HDAC9a]. 

3. SEQ ID NO : 5 and 6 [HDAC9-ANLS]. 

4. SEQ ID NO : 7 and 8 [HDAC9a-ANLS]. 

5. SEQ ID NO : 9 and 10 [HDRP-ANLS]. 

The claims are deemed to correspond to the species listed above in the following manner: 

Each of the claims listed in groups I- VIII correspond to each of the 5 species which are structurally distinct 

The following claim(s) are generic: 1-5. 

The inventions listed as Groups I- VIII do not relate to a single general inventive concept under PCT Rule 13. 1 because, under PCT 
Rule 13.2, they lack the same or corresponding special technical features for the following reasons: Group I has a special technical 
feature of the nucleotide sequence encoding a specific histone deacetylase which Groups II- VIII do not share; Group II has a special 
technical feature of the antibody to a specific histone deacetylase which Groups I & HI-VHl do not share; Groups ITI- VTH employ 
nucleic acid or polypeptide in various method of identifying compounds or polypeptides for distinct uses. Further, in view of 37 CFR 
1.475 (b), when claims corresponding to different categories of inventions are present then only (3) applies and additional methods of 
use are deemed to lack unity. 

The species listed above do not relate to a single general inventive concept under PCT Rule 13. 1 because, under PCT Rule 13.2, the 
species lack the same or corresponding special technical features for the following reasons: The various species correspond to nucleic 
acid and polypeptide sequences which are structurally and in activity distinct from each other, therefore lack the same or 
corresponding special technical feature. 
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